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REMARKS 

Applicants have amended the specification to recognize the Trademark ATCC . 

Applicants have cancelled Claims 1-3 without prejudice to, or disclaimer of, the subject 
matter contained therein. Applicants maintain that the cancellation of a claim makes no 
admission as to its patentability and reserve the right to pursue the subject matter of the cancelled 
claim in this or any other patent application. 

Applicants have amended Claim 4 to be in independent form, and have amended Claims 
5 and 12 to depend from Claim 4. Claim 13 is amended to replace the term "epitope tag" with 
the term "tag polypeptide." New Claims 14-17 have been added. Thus, Claims 4-8 and 11-17 
are presented for further examination. 

Applicants submit that no new matter was added by the amendments, and that support for 
the amendments can be found throughout the specification. Support for the amendments to 
Claim 13 can be found, for example, at paragraph [0229]. Support for new Claims 14-17 can be 
found, for example, in the claims as originally filed and paragraphs [0336], [0362], [407], and 
Example 18 starting at paragraph [0529]. 

Applicants thank the Examiner for the review of the instant application, and withdrawing 
the objections to the claims, title and specification. Applicants acknowledge the Examiner's 
withdrawal of the rejection of Claims 1-13 under 35 U.S.C. § 112, second paragraph, as being 
indefinite. The rejections of the presently pending claims are respectfully traversed. 
Information Disclosure Statement 

The Examiner has stated that the new patent listed in the information disclosure statement 
filed October 4, 2004 (mailed by Applicants on September 30, 2004), has not been fully 
considered since Applicants did not submit the requisite fee. Applicants submit that the October 
4, 2004 form PTO-1449 did not disclose a "new" patent, but rather corrected a typographical 
error in the form PTO-1449 submitted on September 6, 2002, as explained in the September 30, 
2004 Office Action Response. A copy of the correct patent was submitted with the original 
form PTO-1449 in September, 2002. Thus, Applicants are not asking the PTO to consider a 
"new" patent. As such, Applicants believe that no fee is due, and respectfully request 
consideration of the patent on file. 



-11- 



Appl. No. : 10/063,563 

Filed : May 2, 2002 



Specification 

Trademarks: 

The Examiner has noted that while the use of trademarks is permissible in patent 
applications, the trademarks should be capitalized and accompanied by the generic terminology. 
The Examiner has pointed out that "ATCC" is a trademark, and needs to be recognized wherever 
it appears. Applicants have amended the specification accordingly. 
Priority 

The Examiner maintains that Applicants are not entitled to a priority date earlier than the 
May 2, 2002 filing date of the instant application. Specifically, the Examiner argues that the 
provisional application filed June 24, 1998 only describes SEQ ID NO: 56 and fails to show 
utility and enablement for the presently claimed polypeptides. Similarly, the Examiner asserts 
that Applicants are not entitled to the priority filing date of August 24, 2000, in which the data 
presented in Example 18 were first disclosed. According to the Examiner, the PCT application 
does not comply with 35 U.S.C. § 120 written description, utility and enablement. 

For the reasons set forth below, Applicants submit that the presently claimed invention 
meets the requirements set forth in 35 U.S.C. §§ 101 and 112, first paragraph. Accordingly, 
Applicants are entitled to a priority date of at least August 24, 2000. 
Rejection Under 35 U.S.C. S101 - Utility 

The Examiner maintains the rejection of Claims 1-8 and 11-13 under 35 U.S.C. § 101 as 
lacking utility for the reasons set forth on pages 5-7 of the previous Office Action. The Examiner 
asserts that further research is necessary to confirm a "real world" utility for the claimed 
polypeptides, and thus they lack substantial utility. The Examiner alleges that Applicants have 
provided a single analysis of [a] nucleic acid without any relative range for basing a utility of under- 
expression for the claimed polypeptides, and that no levels (relative or absolute) are disclosed. The 
Examiner asserts that applicants have provided "no guidance on how to use [the data presented in 
Example 18]." Office Action at 5. 

Next, the Examiner asserts that the functional language "wherein the nucleic acid encodes a 
polypeptide that is more highly expressed in normal skin" does not confer utility on the claimed 
polypeptides because there is no data regarding protein expression in melanoma and normal skin. 
The Examiner maintains that Applicants cannot rely on a correlation between mRNA levels and 
increased polypeptide levels to establish that PRO 1027 is more highly expressed in normal skin than 
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in melanoma. According to the Examiner "[t]he art clearly establishes that DNA copy number, 
mRNA levels and protein levels are not inexorably related, in that an increase in one necessarily 
leads to an increase in all of them. Transcription levels (mRNA) do not correlate with polypeptide 
levels ." Id. (emphasis added). The Examiner relies upon six references to support the proposition 
that "one skilled in the art would not associate DNA copy number, mRNA and protein levels and 
necessarily reflecting each other" [sic]: (1) Haynes et al. (1998); (2) an excerpt from Lewin, Genes 
VI (1997); (3) Gokman-Polar et al. (2001); (4) Pennica et al. (1998); (5) Hu et al. (2003); and (6) 
Konopka et al (1986). 

The Examiner also alleges that the data in the specification is not scientifically significant 
and does not teach the expression level of the claimed protein. The Examiner alleges that the 
data in Example 18 is a preliminary observation that requires "substantive experimentation to 
ascertain the veracity thereof." Office Action at 7. Further, the Examiner states that the visual 
quantification using ethidium bromide staining of PCR products on gels is "not persuasive" and 
is "highly subjective." Office Action at 8. Further, according to the Examiner, without knowing 
the range of variation of transcript levels, there is insufficient guidance as to the substantial 
utility of the claimed invention. Office Action at 9. 

The Examiner rejects Applicants assertion that regardless of whether the polypeptide is 
over-expressed in melanoma, the polypeptide has utility based on the differential expression of 
the encoding mRNA. Specifically, the Examiner argues that unlike Her2/Neu, no mutation or 
translocation of the gene encoding PRO 1027 has been associated with tumor formation or the 
development of cancer. 
Utility - Legal Standard 

According to the Utility Examination Guidelines ("Utility Guidelines"), 66 Fed. Reg. 
1092 (2001) an invention complies with the utility requirement of 35 U.S.C. § 101, if it has at 
least one asserted "specific, substantial, and credible utility" or a "well-established utility." 

Under the Utility Guidelines, a utility is "specific" when it is particular to the subject 
matter claimed. For example, it is generally not enough to state that a nucleic acid is useful as a 
diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "The basic 
quid pro quo contemplated by the Constitution and the Congress for granting a patent monopoly 
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is the benefit derived by the public from an invention with substantial utility." In explaining the 
"substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office personnel must 
be careful not to interpret the phrase "immediate benefit to the public" or similar formulations 
used in certain court decisions to mean that products or services based on the claimed invention 
must be "currently available" to the public in order to satisfy the utility requirement. "Rather, any 
reasonable use that an applicant has identified for the invention that can be viewed as providing 
a public benefit should be accepted as sufficient, at least with regard to defining a 'substantial' 
utility." (M.P.E.P. § 2107.01, emphasis added). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose ... and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Utility need NOT be Proved to a Statistical Certainty - a Reasonable Correla tion between the 

Evidence and the Asserted Utility is Sufficient 

An Applicant's assertion of utility creates a presumption of utility that will be sufficient to 

satisfy the utility requirement of 35 U.S.C. § 101, "unless there is a reason for one skilled in the 

art to question the objective truth of the statement of utility or its scope." In re Langer, 503 F.2d 

1380, 1391, 183 USPQ 288, 297 (CCPA 1974). See, also In re Jolles, 628 F.2d 1322, 206 USPQ 

885 (CCPA 1980); In re Irons, 340 F.2d 974, 144 USPQ 351 (1965); In re Sichert, 566 F.2d 

1154, 1159, 196 USPQ 209, 212-13 (CCPA 1977). Compliance with 35 U.S.C. § 101 is a 

question of fact. Raytheon v. Roper, 724 F.2d 951, 956, 220 USPQ 592, 596 (Fed. Cir. 1983) 

cert, denied, 469 US 835 (1984). The evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 USPQ2d 1443, 1444 (Fed. Cir. 1992). 

This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
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utility is more likely than not . M.P.E.P. at § 2107.02, part VII (2004) (underline 
emphasis in original, bold emphasis added, internal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 
would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely prove 
that the asserted utility is real. Rather, the evidence only needs to be reasonably indicative of the 
asserted utility. 

In Fujikawa v. Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996), the Court 

of Appeals for the Federal Circuit upheld a PTO decision that in vitro testing of a novel 

pharmaceutical compound was sufficient to establish practical utility, stating the following rule: 

[TJesting is often required to establish practical utility. But the test results need 
not absolutely prove that the compound is pharmacologically active. All that is 
required is that the tests be "reasonably indicative of the desired 
[pharmacological] response." In other words, there must be a sufficient 
correlation between the tests and an asserted pharmacological activity so as to 
convince those skilled in the art, to a reasonable probability, that the novel 
compound will exhibit the asserted pharmacological behavior." Fujikawa v. 
Wattanasin, 93 F.3d 1559, 1564, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996) (internal 
citations omitted, bold emphasis added, italics in original). 

While the Fujikawa case was in the context of utility for pharmaceutical compounds, the 
principals stated by the Court are applicable in the instant case where the asserted utility is for a 
therapeutic and diagnostic use - utility does not have to be established to an absolute certainty, 
rather, the evidence must convince a person of skill in the art "to a reasonable probability." In 
addition, the evidence need not be direct, so long as there is a "sufficient correlation" between 
the tests performed and the asserted utility. 

The Court in Fujikawa relied in part on its decision in Cross v. Iizuka, 753 F.2d 1040, 
224 U.S.P.Q. 739 (Fed. Cir. 1985). In Cross, the Appellant argued that basic in vitro tests 
conducted in cellular fractions did not establish a practical utility for the claimed compounds. 
Appellant argued that more sophisticated in vitro tests using intact cells, or in vivo tests, were 
necessary to establish a practical utility. The Court in Cross rejected this argument, instead 
favoring the argument of the Appellee: 
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[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. [Appellee] has not urged, 
and rightly so, that there is an invariable exact correlation between in vitro test 
results and in vivo test results. Rather, [Appellee's] position is that successful in 
vitro testing for a particular pharmacological activity establishes a significant 
probability that in vivo testing for this particular pharmacological activity will be 
successful. Cross v. Iizuka, 753 F.2d 1040, 1050, 224 U.S.P.Q. 739 (Fed. Cir. 
1985) (emphasis added). 

The Cross case is very similar to the present case. Like in vitro testing in the 
pharmaceutical industry, those of skill in the field of biotechnology rely on the reasonable 
correlation that exists between gene expression and protein expression (see below). Were there 
no reasonable correlation between the two, the techniques that measure gene levels such as 
microarray analysis, differential display, and quantitative PCR would not be so widely used by 
those in the art. As in Cross, Applicants here do not argue that there is "an invariable exact 
correlation" between gene expression and protein expression. Instead, Applicants' position 
detailed below is that a measured change in gene expression in cancer cells establishes a 
"significant probability" that the expression of the encoded polypeptide in cancer will also be 
changed based on "a reasonable correlation therebetween." 

Taken together, these cases illustrate that the legal standard for demonstrating utility is a 
relatively low hurdle. An Applicant need only provide evidence such that it is more likely than 
not that a person of skill in the art would be convinced, to a reasonable probability, that 
the asserted utility is true. The evidence need not be direct evidence, so long as there is a 
reasonable correlation between the evidence and the asserted utility. The Applicant does not 
need to provide evidence such that it establishes an asserted utility as a matter of statistical 
certainty. 

Even assuming that the Examiner satisfied the initial burden of offering evidence that one 
of ordinary skill in the art would reasonably doubt the truth of the asserted utility, Applicants 
assert that they have met their burden of providing rebuttal evidence such that it is more likely 
than not those skilled in the art, to a reasonable probability, would believe that the claimed 
invention is useful as a diagnostic tool for cancer. 
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Substantial Utility 

Summary of Applicants ' Arguments and the Examiner 's Response 

In an attempt to clarify Applicants' argument, Applicants offer a summary of their 
argument and the disputed issues involved. Applicants assert that the claimed polypeptides have 
utility as diagnostic tools for cancer, particularly skin cancer. Applicants are not asserting that 
the claimed polypeptides necessarily provide a definitive diagnosis of cancer, but rather that they 
are useful, alone or in combination with other diagnostic tools to assist in the diagnosis of skin 
cancer. Applicants' asserted utility rests on the following argument: 

1. Applicants have provided reliable evidence that mRNA for the PRO 1027 polypeptide 
is more highly expressed in melanoma compared to normal skin; 

2. Applicants assert that it is well-established in the art that a change in the level of 
mRNA for a particular protein, e.g. an increase, generally leads to a corresponding change in the 
level of the encoded protein, e.g. an increase; 

3. Given Applicants' evidence that the level of mRNA for the PRO 1027 polypeptide is 
increased in melanoma compared to normal skin, it is likely that the PRO 1027 polypeptide is 
differentially expressed in melanoma and is therefore useful as a diagnostic tool to distinguish 
melanoma from normal skin. 

Applicants understand the Examiner to be making several arguments in response to 
Applicants' asserted utility: 

1. The Examiner challenges the reliability of the evidence reported in Example 18, and 
states that it does not provide information sufficient to allow a skilled artisan to diagnose any 
disease; 

2. The Examiner cites Haynes et al., Lewin (Genes VI), Gokman-Polar, Pennica, Hu, and 
Konopka et al. to support the position that the mRNA expression data does not provide utility to 
the claimed polypeptides because DNA, mRNA and protein levels do not necessarily correlate; 

3. The Examiner argues that because no mutation or translocation of PRO 1027 has been 
associated with melanoma, the disclosure is insufficient to meet the utility requirement; 

4. The Examiner argues that the specification and the art do not teach how the levels of 
PRO 1027 fit into characterization of tumors, or better determination of therapy. 

As detailed below, Applicants submit that the PTO has failed to meet its initial burden to 
offer evidence "that one of ordinary skill in the art would reasonably doubt the asserted utility." 
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In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). First, the Examiner has 
failed to offer any evidence to support its rejection of the data in Example 18 and the Declaration 
of Chris Grimaldi in support of these data. Second, the fact that no mutation or translocation of 
PRO 1027 has been associated with melanoma does not prevent its use as a diagnostic tool for the 
same. Third, Applicants submit that given the well-established correlation between a change in 
the level of mRNA with a corresponding change in the levels of the encoded protein, the 
PRO 1027 protein is likely differentially expressed in melanoma compared to normal skin. This 
provides utility for the PRO 1027 and related proteins as cancer diagnostic tools. Finally, even if 
the PTO has met its initial burden, Applicants have submitted enough rebuttal evidence to 
establish that it is more likely than not that a person of skill in the art would be convinced, to a 
reasonable probability, that the asserted utility is true. As stated above, Applicants' evidence 
need not be direct evidence, so long as there is a reasonable correlation between the evidence and 
the asserted utility. The standard is not statistical or absolute certainty. 

Applicants have established that the Gene Encoding the PRO1027 Polypeptide is Differentially 
Expressed in Certain Cancers compared to Normal Tissue 

Applicants first address the Examiner's argument that the evidence of differential 
expression of the gene encoding the PRO 1027 polypeptide melanoma is insufficient to support a 
substantial utility. 

The gene expression data in the specification, Example 18, shows that the mRNA 
associated with protein PRO 1027 was more highly expressed in melanoma compared to normal 
skin tissue. Gene expression was analyzed using standard semi-quantitative PCR amplification 
reactions of cDNA libraries isolated from different melanoma and normal skin tissue samples. 
Identification of the differential expression of the PRO1027 polypeptide-encoding gene in tumor 
tissue compared to the corresponding normal tissue renders the molecule useful as a diagnostic 
tool for the determination of the presence or absence of tumor. In support, Applicants previously 
submitted as Exhibit 1 a first Declaration of J. Christopher Grimaldi, an expert in the field of 
cancer biology. This declaration explains the importance of the data in Example 18, and how 
differential gene and protein expression studies are used to differentiate between normal and 
tumor tissue (see Declaration, paragraph 7). 
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In paragraph 5 of his declaration, Mr. Grimaldi states that the gene expression studies 
reported in Example 18 of the instant application were made from pooled samples of normal and 
of tumor tissues. Mr. Grimaldi explains that: 

The DNA libraries used in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely 
to be accurate than data obtained from a sample from a single individual That 
is, the detection of variations in gene expression is likely to represent a more 
generally relevant condition when pooled samples from normal tissues are 
compared with pooled samples from tumors in the same tissue type. 
(Paragraph 5) (emphasis added). 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 
employed to generate the data of Example 18 is sufficient to determine if a gene is over- or 
under-expressed in tumor cells compared to corresponding normal tissue. He states that any 
visually detectable difference seen between two samples is indicative of at least a two-fold 
difference in cDNA between the tumor tissue and the counterpart normal tissue. Thus, the 
results of Example 18 reflect at least a two-fold difference in PRO 1027 mRNA levels between 
normal and tumor samples. He also states that the results of the gene expression studies indicate 
that the genes of interest "can be used to differentiate tumor from normal," thus establishing their 
reliability. He explains that "The precise levels of gene expression are irrelevant; what matters is 
that there is a relative difference in expression between normal tissue and tumor tissue." 
(Paragraph 7). Thus, since it is the relative level of expression between normal tissue and 
suspected cancerous tissue that is important, the precise level of expression in normal tissue is 
irrelevant. Likewise, there is no need for quantitative data to compare the level of expression in 
normal and tumor tissue. As Mr. Grimaldi states, "If a difference is detected, this indicates that 
the gene and its corresponding polypeptide and antibodies against the polypeptide are useful for 
diagnostic purposes, to screen samples to differentiate between normal and tumor." 

Nevertheless, the Examiner maintains that the Grimaldi Declaration is insufficient to 

overcome the rejection of Claims 1-8 and 11-13. The Examiner argues that Example 18 is 

insufficient because it does not teach how high the expression level is, the expression range for 

normal and tumor tissue, whether the results are statistically significant, or the number of 

samples that were used. The Examiner also argues that one cannot determine if the observed 

increase in PRO 1027 "nucleic acid" is due to a mutation, copy number differences (presumably 

DNA), or an increase in transcription rates. The Examiner concludes that the disclosure would 
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not enable one of skill in the art to differentiate amongst expression levels to diagnose any 
disease. The Examiner also alleges that the Grimaldi declaration is unpersuasive as it does not 
address the PRO 1027 polypeptide. 

Applicants submit that the declaration of Mr. Grimaldi is based on personal knowledge of 
the relevant facts at issue. Mr. Grimaldi is an expert in the field and conducted or supervised the 
experiments at issue. Applicants remind the Examiner that "[o]ffice personnel must accept an 
opinion from a qualified expert that is based upon relevant facts whose accuracy is not being 
questioned." PTO Utility Examination Guidelines (2001) (emphasis added). In addition, 
declarations relating to issues of fact should not be summarily dismissed as "opinions" without 
an adequate explanation of how the declaration fails to rebut the Examiner's position. In re Alton 
76 F.3d 1168 (Fed. Cir. 1996). The Examiner has not supplied any reasons or evidence to 
question the accuracy of the facts upon which Mr. Grimaldi based his opinion. Mr. Grimaldi has 
personal knowledge of the relevant facts, has based his opinion on those facts, and the Examiner 
has offered no reason or evidence to reject either the underlying facts or his opinion, other than 
summarily stating that visual detection is highly subjective. Applicants note that the statement 
that any visually detectable difference between two samples is indicative of at least a two-fold 
difference in cDNA is a statement of scientific fact. As discussed above, a declaration that is 
filed to address a question of fact must be considered by the Examiner and if the Examiner 
maintains his rejection, he must be able to explain why the declaration fails. In re Alton 76 F.3d 
1168 (Fed. Cir. 1996). In addition, Applicants provide herewith as Exhibit 1 a copy of page 122 
of the 2002-2003 New England Biolabs catalog. Exhibit 1 -shows DNA size markers of differing 
lengths run on an agarose gel. The column on the left provides the mass of each marker in 
nanograms and the column on the right provides the length of the marker. It is apparent that the 
band intensity of markers having mass differences of two fold are readily distinguishable by eye 
(See for example, the difference in band intensities of the 0.1 kb fragment present at 61ng and the 
0.5kb marker present at 124ng). Accordingly, Applicants maintain that the procedures used to 
detect differences in expression levels were sufficiently sensitive to detect two-fold differences. 

In light of the above, the Examiner should accept Mr. Grimaldi' s opinion with regard to 
his statement that "any visually detectable difference seen between two samples is indicative of 
at least a two-fold difference in cDNA between the tumor tissue and the counterpart normal 
tissue" and that the genes of interest "can be used to differentiate tumor from normal." Together, 
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these statements establish that there is at least a two-fold difference in expression, and that the 
results are reliable enough that they can be used to distinguish tumor from normal tissue. 
Furthermore, Applicants submit that the first Grimaldi declaration addresses PRO 1027, as well 
as the other PRO polypeptides listed in Example 18. Paragraphs 4-7 relate to the data reported in 
Example 18 of the specification. Example 18 teaches that the nucleic acids encoding various 
PRO polypeptides, including PRO 1027, are differentially expressed in normal tissue as compared 
to cancer tissue samples. As such, the first Grimaldi declaration addresses the claimed invention. 

In conclusion, Applicants submit that the evidence reported in Example 18, combined 
with the first Grimaldi Declaration, establish that there is at least a two-fold difference in 
PRO 1027 cDNA between melanoma and normal skin tissue. Therefore, it follows that 
expression levels of the PRO 1027 gene can be used to distinguish melanoma from normal skin. 
The Examiner has not offered any significant arguments or evidence to the contrary. 

As Applicants explain below, it is more likely than not that the PRO 1027 polypeptide is 
also differentially expressed in melanoma, and can therefore be used to distinguish melanoma 
from normal skin. This provides utility for the claimed polypeptides. 

Applicants have established that the Accepted Understanding in the Art is that there is a Direct 
Correlation between mRNA Levels and the Level of Expression of the Encoded Protein 

Applicants next turn to the second portion of their argument in support of their asserted 
utility - that it is well-established in the art that a change in the level of mRNA for a particular 
protein generally leads to a corresponding change in the level of the encoded protein; given 
Applicants' evidence of differential expression of the mRNA for the PRO 1027 polypeptide in 
melanoma compared to normal skin, it is more likely than not that the PRO1027 polypeptide is 
differentially expressed; and proteins differentially expressed in certain tumors have utility as 
diagnostic tools. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Applicants previously submitted a copy of a second Declaration by J. Christopher 
Grimaldi, an expert in the field of cancer biology (previously attached as Exhibit 2). This 
declaration also addresses the data in Example 18, and therefore specifically addresses Example 
18. As stated in paragraph 5 of the declaration, "Those who work in this field are well aware that 
in the vast majority of cases, when a gene is over-expressed... the gene product or polypeptide 
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will also be over-expressed.... This same principal applies to gene under-expression." Further, 

"the detection of increased mRNA expression is expected to result in increased polypeptide 

expression, and the detection of decreased mRNA expression is expected to result in decreased 

polypeptide expression. The detection of increased or decreased polypeptide expression can be 

used for cancer diagnosis and treatment." The references cited in the declaration and submitted 

herewith support this statement. 

Applicants also previously submitted a copy of the declaration of Paul Polakis, Ph.D. 

(previously attached as Exhibit 3), an expert in the field of cancer biology. As stated in 

paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases] and my knowledge of the relevant scientific literature, it is my 
considered scientific opinion that for human genes, an increased level of mRNA 
in a tumor cell relative to a normal cell typically correlates to a similar increase in 
abundance of the encoded protein in the tumor cell relative to the normal cell. In 
fact, it remains a central dogma in molecular biology that increased mRNA levels 
are predictive of corresponding increased levels of the encoded protein. 
(Emphasis added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that "such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." (Polakis Declaration, 
paragraph 6). 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et ai, Molecular Biology of 
the Cell (3 rd ed. 1994) (submitted herewith as Exhibit 2) and (4 th ed. 2002) (submitted herewith 
as Exhibit 3). Figure 9-2 of Exhibit 2 shows the steps at which eukaryotic gene expression can 
be controlled. The first step depicted is transcriptional control. Exhibit 2 provides that "[f]or 
most genes transcriptional controls are paramount. This makes sense because, of all the possible 
control points illustrated in Figure 9-2, only transcriptional control ensures that no superfluous 
intermediates are synthesized." Exhibit 2 at 403 (emphasis added). In addition, the text states 
that "Although controls on the initiation of gene transcription are the predominant form of 
regulation for most genes , other controls can act later in the pathway from RNA to protein to 
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modulate the amount of gene product that is made." Exhibit 2 at 453 (emphasis added). Thus, as 
established in Exhibit 2, the predominant mechanism for regulating the amount of protein 
produced is by regulating transcription initiation. 

In Exhibit 3, Figure 6-3 on page 302 illustrates the basic principle that there is a 
correlation between increased gene expression and increased protein expression. The 
accompanying text states that "a cell can change (or regulate) the expression of each of its genes 
according to the needs of the moment - most obviously by controlling the production of its 
mRNA." Exhibit 3 at 302 (emphasis added). Similarly, Figure 6-90 on page 364 of Exhibit 3 
illustrates the path from gene to protein. The accompanying text states that while potentially 
each step can be regulated by the cell, " the initiation of transcription is the most common point 
for a cell to regulate the expression of each of its genes ." Exhibit 3 at 364 (emphasis added). 
This point is repeated on page 379, where the authors state that of all the possible points for 
regulating protein expression, " |"f|or most genes transcriptional controls are paramount ." Exhibit 
3 at 379 (emphasis added). 

Further support for Applicants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) (submitted herewith as Exhibit 4) which states "having 
acknowledged that control of gene expression can occur at multiple stages, and that production of 
RNA cannot inevitably be equated with production of protein, it is clear that the overwhelming 
majority of regulatory events occur at the initiation of transcription ." Genes VI at 847-848 
(emphasis added). 

Additional support is also found in Zhigang et aL, World Journal of Surgical Oncology 
2:13, 2004, submitted herewith as Exhibit 5. Zhigang studied the expression of prostate stem 
cell antigen (PSCA) protein and mRNA to validate it as a potential molecular target for diagnosis 
and treatment of human prostate cancer. The data showed "a high degree of correlation between 
PSCA protein and mRNA expression." Exhibit 5 at 6. Of the samples tested, 81 out of 87 
showed a high degree of correlation between mRNA expression and protein expression. The 
authors conclude that "it is demonstrated that PSCA protein and mRNA overexpressed in human 
prostate cancer, and that the increased protein level of PSCA was resulted from the upregulated 
transcription of its mRNA." Exhibit 5 at 11. Even though the correlation between mRNA 
expression and protein expression occurred in 93% of the samples tested, not 100%, the authors 
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state that "PSCA may be a promising molecular marker for the clinical prognosis of human Pea 
and a valuable target for diagnosis and therapy of this tumor." Id. 

Further, Meric et ah, Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), submitted 
herewith as Exhibit 6, states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the level 
of mRNA, which can be attributable to either DNA amplification or to differences 
in transcription. Meric et ah at 971 (emphasis added). 

Those of skill in the art would not be focusing on differences in gene expression between cancer 
cells and normal cells if there were no correlation between gene expression and protein 
expression. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, and the 
excerpts and references provided above all establish that the accepted understanding in the art is 
that there is a reasonable correlation between changes in gene expression and the level of the 
encoded protein. 

In response to the second Grimaldi Declaration, the Examiner states that unlike 
Her2/Neu, PRO 1027 "has not been associated with tumor formation or the development of 
cancer." Office Action at 10. The Examiner also maintains that regarding t(5;14) discussed in 
the second Grimaldi declaration, "[n]o mutation or translocation of PRO 1027 is known or 
established to occur." Id. The Examiner concludes that in the absence of any of the above 
information, the disclosure is insufficient to meet the utility requirement. Applicants submit that 
a lack of known role for PRO 1027 in cancer does not prevent its use as a diagnostic tool for 
cancer. The fact that there is no known translocation or mutation of PRO 1027 is irrelevant to 
whether its differential expression can be used to assist in diagnosis of cancer - one does not 
need to know why PRO 1027 is differentially expressed, or what the consequence of the 
differential expression is, in order to exploit the differential expression to distinguish tumor from 
normal tissue. In fact the Revised Interim Utility Guidelines promulgated by the PTO recognize 
that proteins which are differentially expressed in cancer have utility. (See the caveat in Example 
12 which state that the utility requirement is satisfied where a protein is expressed in melanoma 
cells but not on normal skin and antibodies against the protein can be used to diagnose cancer.) 
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In addition, while Applicants appreciate that actions taken in other applications are not binding 
on the PTO with respect to the present application, Applicants note that the PTO has issued 
several patents claiming differentially expressed polypeptides. {See, e.g., U.S. Patent No. 
6,414,117 and U.S. Patent No. 6,124,433, attached hereto as Exhibits 7 and 8.) Because it is 
more likely than not that the PRO 1027 protein is also differentially expressed in certain tumors, 
the protein can be used as a diagnostic tool for cancer. 

Thus, the Examiner's rejection of the second Grimaldi Declaration and Polakis 
Declaration because they are viewed as insufficient is misplaced. Accordingly, Applicants 
submit that they have offered sufficient evidence to establish that it is more likely than not that 
one of skill in the art would believe that because the PRO 1027 mRNA is more highly expressed 
in melanoma tumor compared to normal skin tissue, the PRO 1027 polypeptide will have the 
same expression pattern. This differential expression of the PRO 1027 and related polypeptides 
make them useful as diagnostic tools for melanoma. 

The Arguments made by the PTO are Not Sufficient to satisfy the PTO's Initial Burden of 
Offering Evidence "that one of ordinary skill in the art would reasonably doubt the asserted 
utility n 

As stated above, an Applicant's assertion of utility creates a presumption of utility that 

will be sufficient to satisfy the utility requirement of 35 U.S.C. § 101, "unless there is a reason 

for one skilled in the art to question the objective truth of the statement of utility or its scope." In 

re hanger, 503 F.2d 1380, 1391, 183 USPQ 288, 297 (CCPA 1974). The evidentiary standard to 

be used throughout ex parte examination in setting forth a rejection is a preponderance of the 

evidence, or "more likely than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 USPQ2d 

1443, 1444 (Fed. Cir. 1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
utility is more likely than not . M.P.E.P. at § 2107.02, part VII (2004) (underline 
emphasis in original, bold emphasis added, internal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 

would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
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1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely prove 
that the asserted utility is real . Rather, the evidence only needs to be reasonably indicative of the 
asserted utility. 

The Examiner has cited six references to establish "that one of ordinary skill in the art 
would reasonably doubt" that the disclosed polypeptide is differentially expressed in certain 
tumors and that the claimed polypeptides can be used as diagnostic tools. Applicants address 
each cited reference in turn. 

The Examiner asserts that Haynes et al. (1998, Electrophoresis, 19:1862-1871) found "no 
strong correlation between protein and transcript level" when studying protein expression in 
Saccharomyces cerevisiae. In particular, the Examiner points to the finding that, for some genes, 
equivalent mRNA levels translated into protein abundances which varied by more than 50-fold. 
The Examiner also points to the statement that protein levels cannot be accurately predicted from 
the level of the corresponding mRNA transcript. 

Applicants submit that Haynes does not contradict the utility or enablement of the instant 
claims. Haynes is a review article dealing with the art of proteome analysis. The assertions in 
Haynes cited by the Examiner were made in an effort to identify shortcomings in the art of 
mRNA quantification to argue for "proteome analysis to become an essential component in the 
comprehensive analysis of biological systems." Haynes, p. 1863. Haynes studied 80 selected 
samples from Saccharomyces cerevisiae, and reported "a general trend but no strong correlation 
between protein and transcript levels (Fig. 1)." Id. However, a cursory inspection of Fig. 1 
shows a clear correlation between the mRNA levels and protein levels measured. This 
correlation is confirmed by an inspection of the full-length research paper from which the data in 
Fig. 1 were derived, presented herein as Exhibit 9 (Gygi et al., Molecular and Cellular Biology, 
Mar. 1999, 1720-1730). Gygi states that "there was a general trend of increased protein levels 
resulting from increased mRNA levels," with a correlation coefficient of 0.935, indicating a 
strong correlation. Gygi, p. 1726. Moreover, Gygi also states that the correlation is especially 
strong for highly expressed mRNAs. Id. Considering that Example 18 of the specification 
shows over-expression of PRO 1027 mRNA in rectal tumor and normal lung, Haynes and Gygi 
actually provide strong evidence in support of a general correlation between mRNA and protein 
levels, and thus the utility of the claimed PRO1027 polypeptides and antibodies. 
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The 50-fold variation referred to by the Haynes and cited by the Examiner, does not in 
any way show the absence of a correlation between mRNA and protein levels, but rather 
identifies the outer limits of variability in the Authors 1 experiments. This variability may support 
the Authors' assertion that the amount of a particular protein cannot accurately predict the 
particular level of the corresponding mRNA transcript, but it does not suggest an absence of a 
general correlation between mRNA and protein levels. Moreover, Gygi states that the high 
degree of variability seen at low levels of mRNA (shown in inset of Fig. 1, Haynes p. 1863) is 
due to the fact that "the magnitude of the error in the measurement of mRNA levels is inversely 
proportional to the mRNA levels." Gygi, p. 1727. Considering that PRO1027 mRNA has been 
shown in Example 18 of the specification to be over-expressed in rectal tumor and normal lung, 
the variability identified by Haynes is even less applicable to establishing the absence of a 
correlation between mRNA and protein levels in the instant case. 

As stated above, the standard for utility is not absolute certainty, but rather whether one 
of skill in the art would be more likely than not to believe the asserted utility. Here, the utility of 
PRO 1027 polypeptides and antibodies as diagnostic tools does not require Applicants to show 
that mRNA levels correlate to protein levels in every case, but rather only that the correlation 
exists more often than not. The data presented in Haynes is not inconsistent with or 
contradictory to the utility or enablement of the instant claims. To the contrary, the data clearly 
show a general correlation between protein levels and mRNA levels, and thus support 
Applicants 1 assertion that such a general correlation exists. 

Even if Haynes supported the Examiner's argument, which it does not, one contrary 
example does not establish that one of skill in the art would find it is more likely than not there is 
no general correlation between mRNA level and protein levels. In fact, the working hypothesis 
among those skilled in the art, as illustrated by the evidence presented above by Applicants, is 
that there is a direct correlation between mRNA levels and protein levels. This is further 
supported by the statement in Haynes that "interpretations of quantitative mRNA expression 
profiles frequently implicitly or explicitly assume that for specific genes the transcript levels are 
indicative of the levels of protein expression." See, Haynes, p. 1863, first full paragraph. Haynes 
does not suggest there is no correlation between mRNA and protein levels, but rather points to 
what the authors believe are shortcomings of using mRNA quantification to predict protein 
levels; specifically, that mRNA levels may not accurately predict protein levels in each 
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particular instance. Considering the more likely than not standard for utility, Haynes' 
identification of reasons why proteomic analysis may be preferable in some cases does not 
contradict Applicants' evidence that there is a general correlation between mRNA and protein 
levels. 

The Examiner next offers the statement in Genes VI that, "having acknowledged that 
control of gene expression can occur at multiple stages, and that production of RNA cannot 
inevitably be equated with production of protein, it is clear that the overwhelming majority of 
regulatory events occur at the initiation of transcription," as evidence that mRNA and protein 
levels do not correlate. (Genes VI, Benjamin Lewin, 1997, Chapter 29 - Regulation of 
Transcription, 1 st page). The Examiner focuses on the statement that "production of RNA cannot 
inevitably be equated with production of protein." However, the Examiner ignores the statement 
that, "it is clear that the overwhelming majority of regulatory events occur at the initiation of 
transcription." Further, Lewin goes on to state that transcription of a gene "is a major control 
point: probably it is the most common level of regulation ." Id., emphasis added. This 
reference provides additional support for Applicants' position that that the accepted 
understanding in the art is that there is a reasonable correlation between gene expression and the 
level of the encoded protein. Applicants do not assert that protein expression is never regulated 
at the translation level (i.e., post-transcriptionally), but rather that, more often than not, regulation 
of protein expression occurs at the transcriptional level, making the level of mRNA a general 
indicator of the level of protein. Far from rebutting this assertion, Lewin in fact supports it by 
stating that regulation occurs at the transcriptional stage in the overwhelming majority of cases, 
and that transcription is the most common level of regulation. 

The Examiner also cites to a reference authored by Gokmen-Polar et al. (2001, Cancer 
Research, 61: 1375-1381) for the proposition that there is not a necessary correlation between 
increased mRNA levels and increased protein levels. Gokmen-Polar et al. analyzed the mRNA 
and protein levels of PKCa, PKCJ3I and PKCpiI in normal versus chemically induced 
preneoplastic colon tissue. The authors report that "PKC(3I and PKCa mRNA levels in [the 
preneoplastic cells] were decreased to a level commensurate with the decrease in protein 
expression of these isozymes ." Gokmen-Polar et al. at 1378 (emphasis added). The authors also 
report that level of PKCJ3II mRNA mid protein was increased in the preneoplastic cells, but due 
to the fact that the increase in protein levels was higher than the increase in mRNA levels, they 
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conclude that the changes in protein expression levels "could not be explained by transcriptional 
control alone ." Id. at 1380 (emphasis added). In other words, Gokmen-Polar et al. shows a 
correlation between mRNA and protein levels for each enzyme analyzed, and merely 
acknowledges the existence of post-transcriptional controls. Thus, rather than providing 
evidence that "transcription levels (mRNA) do not correlate with polypeptide levels" (Office 
Action at 5), Gokmen-Polar et al. disclose data that illustrate this correlation. 

The Examiner also cites Pennica et al. (1998, Proc. Nat. Acad. Sci. USA, 95: 14717- 
14722) for the proposition that "there is a lack of correlation between gene amplification and 
protein expression" and that "the asserted correlation between each gene, mRNA and 
corresponding protein in tumors is unpredictable and well-known to the skilled artisan." Office 
Action at 6. The evidence in Pennica of a lack of correlation between DNA copy number and 
mRNA levels says nothing about a lack of correlation between the level of mRNA and the level 
of protein expression - in fact, Pennica did not even examine protein expression. It is the 
correlation between mRNA level and the level of protein expression which is at issue here, not 
the correlation of gene copy number and mRNA levels. The data Applicants report in Example 
18 indicate there are more copies of the mRNA encoding PRO 1027 polypeptides in rectal tumor 
compared to normal rectum, and less PRO 1027 mRNA in lung tumor compared to normal lung, 
and thus whether gene amplification leads to increased mRNA is not at issue. Nothing in 
Pennica is contrary to Applicants' assertion that it is well-established in the art that the level of 
protein is positively correlated to the level of mRNA. Furthermore, the authors of Pennica 
expressly qualify the data upon which they based the statement relied upon by the Examiner for 
the proposition that the field is unpredictable. Specifically, the authors report that the " apparent 
amplification observed for WISP-2 may be caused by another gene in [the] amplicon." Pennica 
at 14722. Applicants submit that Pennica does not establish unpredictability regarding whether 
DNA copy number and mRNA levels correlate, which Applicants submit is not at issue. Rather, 
a full reading of Pennica reveals that the authors recognize the limitations of their experiments 
and expressly cautions readers from drawing the exact conclusion that the Examiner draws from 
their data. Thus, Applicants submit that Pennica does not establish that it is more likely than not 
that one skilled in the art would doubt Applicants' asserted utility. 

The Examiner cites Hu et al. (2003, J. Proteome Research, 2: 405-412) for the 
proposition that "it is not the norm that increased/decreased gene transcription results in 
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increased/decreased polypeptide levels." Office Action at 7. Applicants submit that Hu is 
uninformative regarding the correlation between gene expression level and protein levels. In Hu, 
the researchers used an automated literature-mining tool to summarize and estimate the relative 
strengths of all human gene-disease relationships published on Medline. They then generated a 
microarray expression dataset comparing breast cancer and normal breast tissue. Using their 
data-mining tool, they looked for a correlation between the strength of the literature association 
between the gene and breast cancer, and the magnitude of the difference in expression level. 
They report that for genes displaying a 5-fold change or less in tumors compared to normal, there 
was no evidence of a correlation between altered gene expression and a known role in the 
disease. See Hu at 411. However, among genes with a 10-fold or more change in expression 
level, there was a strong correlation between expression level and a published role in the disease. 
Id. at 412. Importantly, Hu reports that the observed correlation was only found among estrogen 
receptor-positive tumors, not ER-negative tumors. Id. 

The general findings of Hu are not surprising - one would expect that genes that have the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of reports of a 
connection in the literature. But this does not mean that genes, and their corresponding proteins, 
with a lower level of change in expression are not important or cannot be used as molecular 
markers of the disease. This is demonstrated by the fact that ER-negative tumors did not show a 
correlation. The correlation reported in Hu only indicates that the greater the change in 
expression level, the more likely it is that there is a published or known role for the gene in the 
disease, as found by their automated literature-mining software. Nowhere in Hu does it say that a 
lack of correlation in their study means that the genes, and their corresponding proteins, with a 
less than five-fold change in level of expression in cancer cannot serve as a molecular marker of 
cancer. Genes with lower levels of change in expression may or may not be the most important 
genes in causing the disease, but the genes and their corresponding proteins can still show a 
consistent and measurable change in expression. While such genes and polypeptides may or may 
not be good targets for further research, they can nonetheless be used as diagnostic tools. Thus, 
Hu does not refute the Applicants' assertion that the PRO 1027 gene, and its corresponding 
polypeptide, can be used as a cancer diagnostic tool because they are differentially expressed in 
certain tumors. 
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The Examiner cites Konopka et al. (1986, Proc. Nat Acad. ScL USA, 83: 4049-4052) in 
further support of the assertion that it is not the norm that levels of mRNA correlate with 
corresponding protein levels. As described for Pennica, the Examiner has confused the 
relationship between an increase in copy number of a gene and the level of mRNA on the one 
hand, with the relationship between mRNA expression and levels of the corresponding protein on 
the other. In particular, the Examiner cites the statement in Konopka that "protein expression is 
not related to amplification of the abl gene but to variation in the level of the bcr-abl mRNA 
produced from a single Phi template." As with Pennica, the results presented in Konopka 
actually present strong evidence in support of Applicants' position that there is a general 
understanding in the art that levels of mRNA correlate with levels of the corresponding proteins. 
Konopka analyzed the expression patterns of a gene associated with certain cancers. The authors 
show a wide variation in the levels of the protein in various cell types, and find that this variation 
can be attributed to the levels of the corresponding mRNA in each cell type. See, Konopka, p. 
4050. Konopka thus concludes, "these combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the variable levels of P210 c ' abl [the 
protein of interest] detected." Id., p. 4051. Thus, far from supporting the Examiner's assertion 
that it is not the norm that increased transcription leads to increased levels of the corresponding 
protein, Konopka strongly supports the opposite proposition asserted by Applicants - that the 
level of mRNA, more often than not, correlates with the level of the corresponding protein. 

Thus, the evidence presented by Applicants, including the declarations of Grimaldi and 
Polakis, the accompanying references, the excerpts from "Molecular Biology of the Cell", and 
the Zhigang et al. and Meric et al. articles establish that the accepted understanding in the art is 
that there is a reasonable correlation between gene expression and the level of the encoded 
protein. Further, the references cited by the Examiner as evidence that no such general 
understanding exists do not in any way support the Examiner's arguments. To the contrary, 
Haynes, Lewin, Gokmen-Polar, Pennica, Hu and Konopka each provide strong evidence of the 
correlation asserted by Applicants. Accordingly, Applicants respectfully submit that the totality 
of the evidence clearly supports the conclusion that one of skill in the art understands that, more 
likely than not, levels of mRNA directly correlate with levels of corresponding proteins. 

Given the lack of support for the Examiner's position, Applicants submit that the 
Examiner has not met its initial burden of overcoming the presumption that the asserted utility is 
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sufficient to satisfy the utility requirement. In light of the references cited by Applicants, and 
further in light of the supporting evidence in the references cited by the Examiner, Applicants 
submit that the evidence is sufficient to establish that one of skill in the art would be more likely 
than not to believe that there is a correlation between mRNA levels and polypeptide levels, and 
thus the claimed polypeptides can be used as diagnostic tools for skin cancer. 

Specific Utility 

The Asserted Substantial Utilities are Specific to the Claimed Polypeptides 

Applicants next address the Examiner's assertion that the asserted utilities are not specific 
to the claimed polypeptides related to PRO1027. Applicants respectfully disagree. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01 I. Applicants submit that the evidence of differential expression of the 
PRO1027 gene and polypeptide in certain types of tumor cells, along with the declarations and 
references discussed above, provide a specific utility for the claimed polypeptides. 

As discussed above, there are significant data which show that it is more likely than not 
that the gene for the PRO 1027 polypeptide is more highly melanoma tumor as compared to 
normal skin. These data are strong evidence that the PRO 1027 gene and polypeptide are 
associated with melanoma. Thus, contrary to the assertions of the Examiner, Applicants submit 
that they have provided evidence associating the PRO 1027 gene and polypeptide with a specific 
disease. The asserted utility as a diagnostic tool for melanoma is a specific utility - it is not a 
general utility that would apply to the broad class of polypeptides. 

Conclusion 

The Examiner has asserted two arguments for why the claimed polypeptides lack utility: 
(1) that the data reported in Example 18 is not reliable and does not establish a correlation 
between the differential protein expression and melanoma; (2) that because there is no necessary 
correlation between gene copy number, mRNA levels and protein levels, the claimed 
polypeptides are not useful as diagnostic tools; 

Applicants have addressed each of these arguments in turn. 
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First, the Applicants provided a first Declaration of Chris Grimaldi stating that the data in 
Example 18 are real and significant. This declaration also indicates that given the relative 
difference in expression levels, the disclosed nucleic acids and corresponding polypeptides have 
utility as cancer diagnostic tools. The Examiner has not offered any substantial reason or 
evidence to question the data in Example 18, or the first Grimaldi Declaration. 

Second, Applicants have demonstrated that it is not necessary to know the cause or 
consequence of the differential expression of PRO 1027 nucleic acids and polypeptides in certain 
tumors in order to use them as diagnostic tools for cancer. 

Third, Applicants have shown that the second Grimaldi Declaration and Polakis 
Declaration, the accompanying references, as well as the excerpts and references cited above, 
demonstrate that it is well-established in the art that a change in mRNA levels generally 
correlates to a corresponding change in the encoded protein levels. The Examiner has not offered 
any substantial reason or evidence to question these declarations and supporting references. To 
the contrary, the Examiner has cited references that support the principle that mRNA levels 
generally correlate with protein levels. 

Finally, the Examiner asserts that there is no asserted specific utility. Applicants have 
pointed out that the substantial utilities described above are specific to the claimed polypeptides 
because the PRO 1027 gene and polypeptide are differentially expressed in melanoma cells 
compared to the corresponding normal skin cells. This is not a general utility that would apply to 
the broad class of polypeptides. 

Given the totality of the evidence provided, Applicants submit that they have established 

a substantial, specific, and credible utility for the claimed polypeptides as diagnostic tools. 

According to the PTO Utility Examination Guidelines (2001), irrefutable proof of a claimed 

utility is not required. Rather, a specific, substantial, and credible utility requires only a 

"reasonable" confirmation of a real world context of use. Applicants remind the Examiner that: 

A small degree of utility is sufficient . . . The claimed invention must only be 
capable of performing some beneficial function . . . An invention does not lack 
utility merely because the particular embodiment disclosed in the patent lacks 
perfection or performs crudely ... A commercially successful product is not 
required . . . Nor is it essential that the invention accomplish all its intended 
functions ... or operate under all conditions . . . partial success being sufficient to 
demonstrate patentable utility ... In short, the defense of non-utility cannot be 
sustained without proof of total incapacity. If an invention is only partially 
successful in achieving a useful result, a rejection of the claimed invention as a 
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whole based on a lack of utility is not appropriate. M.P.E.P. at 2107.01 (underline 
emphasis in original, bold emphasis added, citations omitted). 

Applicants submit that they have established that it is more likely than not that one of 
skill in the art would reasonably accept the utility for the claimed polypeptides relating to 
PRO 1027 set forth in the specification. In view of the above, Applicants respectfully request that 
the PTO reconsider and withdraw the utility rejection under 35 U.S.C. §101. 

Rejection under 35 U.S.C. S 112. first paragraph - Enablement 

The Examiner maintains the rejection of Claims 1-8 and 11-13 as containing subject 
matter which was not described in the specification in such a way as to enable one skilled in the 
art to use the invention. The Examiner maintains that because the claimed invention is not 
supported by either a specific, substantial and credible asserted utility, or a well-established 
utility, the claims are not enabled. 

Applicants submit that in the discussion of the rejection under 35 U.S.C. §101 above, 
Applicants have established a substantial, specific and credible utility for the claimed 
polypeptides. Applicants therefore request that the Examiner withdraw the enablement rejection 
under 35 U.S.C. §112, first paragraph. 

The Examiner has also rejected Claims 1-6 and 11-13 on the grounds that the 
specification allegedly lacks compete deposit information for the deposit of the full-length cDNA 
encoding the claimed polypeptide. The Examiner requests that Applicants provide a declaration 
or affidavit providing assurances that all restrictions upon public access to the ATCC™ 
accession number 203245 as specifically claimed will be irrevocably removed upon the grant of a 
patent. Although paragraph [446] of the specification states that permanent and unrestricted 
availability of the deposits will be provided upon issuance of the pertinent U.S. Patent or laying 
open to the public of any U.S. or foreign patent application, whichever comes first, Applicants 
provide the requested Declaration herewith. Applicants request that the Examiner withdraw the 
enablement rejection accordingly. 

Rejection under 35 U.S.C. § 112, first paragraph - Written Description 

The Examiner has maintained the rejection of Claims 1-5 and 12-13 under 35 U.S.C. 
§112, first paragraph, as allegedly containing subject matter which was not described in the 
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specification in such a way as to reasonably convey to one skilled in the relevant art that the 
Applicants, at the time the application was filed, had possession of the claimed invention. The 
Examiner asserts that the addition of the functional limitation "wherein said isolated nucleic acid 
encodes a polypeptide that is more highly expressed in melanoma relative to normal skin cells" 
does not describe the function of the protein and does not impart specific structural information. 
The Examiner also maintains that the specification does not teach variants of either the nucleic 
acid or polypeptide. According to the Examiner, the skilled artisan would not be able to readily 
envision the claimed genus. Further, the Examiner asserts that while the skilled artisan may 
envision changes to the polypeptide of SEQ ED NO: 56, one cannot envision what changes to the 
structure or what part of the structure is conserved. 
The Legal Standard for Written Description 

The well-established test for sufficiency of support under the written description 
requirement of 35 U.S.C. §112, first paragraph is whether the disclosure "reasonably conveys to 
artisan that the inventor had possession at that time of the later claimed subject matter." In re 
Kaslow, 707 F.2d 1366, 1375, 2121 USPQ 1089, 1096 (Fed. Cir. 1983); see also Vas-Cath, Inc. 
v. Mahurkar, 935 F.2d atl563, 19 USPQ2d at 1116 (Fed. Cir. 1991). The adequacy of written 
description support is a factual issue and is to be determined on a case-by-case basis. See e.g., 
Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 USPQ2d at 1116 (Fed. Cir. 1991). The factual 
determination in a written description analysis depends on the nature of the invention and the 
amount of knowledge imparted to those skilled in the art by the disclosure. Union Oil v. Atlantic 
Richfield Co., 208 F.3d 989, 996 (Fed. Cir. 2000). 
The Current Invention is Adequately Described 

As noted above, whether the Applicants were in possession of the invention as of the 
effective filing date of an application is a factual determination, reached by the consideration of a 
number of factors, including the level of knowledge and skill in the art, and the teaching 
provided by the specification. The inventor is not required to describe every single detail of 
his/her invention. An Applicant's disclosure obligation varies according to the art to which the 
invention pertains. The present invention pertains to the field of recombinant DNA/protein 
technology. It is well-established that the level of skill in this field is very high since a 
representative person of skill is generally a Ph.D. scientist with several years of experience. 
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Accordingly, the teaching imparted in the specification must be evaluated through the eyes of a 
highly skilled artisan as of the date the invention was made. 

As amended, the pending claims are related to isolated polypeptides having at least 95% 
or 99% amino acid sequence identity to several polypeptides related to SEQ ID NO: 56, and 
satisfy the limitation "wherein said isolated polypeptide is more highly expressed in melanoma 
compared to normal skin, or wherein said isolated polypeptide is encoded by a polynucleotide 
that is more highly expressed in melanoma compared to normal skin" or "wherein said isolated 
polypeptide or a fragment thereof can be used to generate an antibody which can be used to 
specifically detect the polypeptide of SEQ ID NO: 56 in skin tissue samples." 

Applicants maintain that there is no substantial variation within the species which fall 
within the scope of the amended claims, which require at least 95% or 99% amino acid sequence 
identity to the disclosed sequences related to SEQ ID NO: 56. Applicants note that the pending 
Claims are analogous to the claims discussed in Example 14 of the written description training 
materials. In Example 14, the written description requirement was found to be satisfied for 
claims relating to polypeptides having 95% homology to a particular sequence and possessing a 
particular catalytic activity, even though the applicant had not made any variants. Similarly, the 
pending claims also have very high sequence homology to the disclosed sequences and must 
share the same expression pattern in certain tumors, or share an epitope sufficient to generate 
antibodies which specifically detect the polypeptide of SEQ ID NO: 56 in skin tissue samples. 

In Example 14, the procedures for making variants were known in the art and the 
disclosure taught how to test for the claimed catalytic activity. Similarly, in the instant 
application, it is well known in the art how to make polypeptides with at least 95% amino acid 
sequence identity to the disclosed sequences. In addition, the specification discloses how to test 
to determine if the polypeptide or encoding nucleic acid is differentially expressed in melanoma, 
and how to make antibodies which specifically detect the polypeptide of SEQ ID NO: 56 in skin 
tissue samples. Like Example 14, the genus of polypeptides that have at least 95% or 99% amino 
acid sequence identity to the disclosed sequences will not have substantial variation. 

Furthermore, while Applicants appreciate that actions taken by the PTO in other 
applications are not binding with respect to the examination of the present application, 
Applicants note that the PTO has issued many patents containing claims to variant nucleic acids 
or variant proteins where the applicants did not actually make such nucleic acids or proteins. 
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Representative patents include U.S. VPatent No: 6,737,522, US. Patent No. 6,395,306, U.S. 



Patent No. 6,025,156, U.S. Patent No. 6,645,499, U.S. Patent No. 6,498,235, and U.S. Patent No. 
6,730,502 which are attached hereto as Exhibits 10-15. 

In conclusion, Applicants submit that they have satisfied the written description 
requirement for the pending claims based on the actual reduction to practice of SEQ ID NO: 56, 
by specifying a high level of amino acid sequence identity, by describing how to test for 
differential expression of the polypeptide and encoding nucleic acid, and by describing how to 
make antibodies to the disclosed sequence, all of which result in a lack of substantial variability 
in the species falling within the scope of the instant claims. Applicants submit that this 
disclosure would allow one of skill in the art to "recognize that the applicant was in possession of 
the necessary common attributes or features of the elements possessed by the members of the 
genus." Hence, Applicants respectfully request that the PTO reconsider and withdraw the written 
description rejection under 35 U.S.C. §1 12. 
Rejection under 35 U.S.C. S 102(b) 

The Examiner has maintained the rejection of Claims 1-7, 11 and 12 under 35 U.S.C. 
§ 102(b) as allegedly being anticipated by Rhodes et al. (1999). The Examiner has maintained 
the position that Applicants are not entitled to the claimed priority dates of June 24, 1998 or 
August 24, 2000, because the priority documents allegedly lack written description, enablement 
and utility for the reasons discussed above. The Examiner alleges that the Stemple Doctrine 
cannot be used to overcome the rejection set forth under 35 U.S.C. § 102(b). 

Applicants respectfully disagree. As previously noted, in In re Moore, which followed 
the rule set forth in In re Stemple, the CCPA expressly stated that "the determination of a 
practical utility when one is not obvious need not have been accomplished prior to the date of a 
reference unless the reference also teaches how to use the compound it describes ." In re Moore, 
170 USPQ 260, 267 (CCPA 1971). The Rhodes et al. May, 1999 sequence database submission 
that forms the basis of the Examiner's rejection disclosed amino acids 1-77 of SEQ ID NO: 56. 
The Examiner has maintained that the disclosure of SEQ ID NO: 56 does not comply with the 
utility or enablement requirements. In other words, under the Examiner's reasoning, Rhodes et 
al. does "not teach how to use the compound it describes." Accordingly, under Moore, since 
Rhodes et al. does not teach how to use SEQ ID NO: 56, Applicants' disclosure in 60/090,444, 
filed June 24, 1998, cannot be dismissed as a priority document due to its alleged lack of utility. 
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Applicants have demonstrated, by means of the disclosure in 60/090,444 that they were in 
possession of so much of the claimed invention, i.e., SEQ ED NO: 56, as disclosed in the Rhodes 
reference dated May 1, 1999. Thus, Applicants submit that the cited reference is not available as 
prior art under 35 U.S.C. § 102(b), and request that the rejection be withdrawn. 



In view of the above, Applicants respectfully maintain that claims are patentable and 
request that they be passed to issue. Applicants invite the Examiner to call the undersigned if any 
remaining issues may be resolved by telephone. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 1 1-1410. 



CONCLUSION 



Respectfully submitted, 



KNOBBE, MARTENS, OLSON & BEAR, LLP 




Customer No. 30,3 1 3 
(619) 235-8550 



1327167J 
030805 
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1 fjg of RNA Ladder 
was heated at 60°C 
for 3 minutes in 1XRNA 
Ladder Sample Buffer 
and visualized by 
ethidium bromide 
staining (1.0% TBE 
agarose gel). 



Description: The RNA Ladder is a set of 7 RNA 
molecules produced by in vitro transcription of a 
mixture of 7 linear DNA templates. The ladder sizes are: 
9000, 7000, 5000, 3000, 2000, 1000 and 500 bases. 
The 3000 base fragment is at double intensity to serve 
as a reference band. This ladder is suitable for use as an 
RNA size standard on denaturing or native agarose gels. 

Reagents Supplied with Ladders: 

2X RNA Ladder Sample Buffer (for use with native 
agarose gels) 

2X RNA Ladder Sample Buffer: 

2X TBE (pH 8.3), 13% ficoll (w/v). 0.01% bromophenol 
blue and 7 M urea. 

Concentration: 500 pg/ml. 

Storage Conditions: 20 mM KOAc (pH 4.5). Store 
at -70°C. For short term storage (< 1 week), ladder can 
be stored at -20°C. 

Notes on Use: 

To avoid ribonuclease contamination: wear gloves, use 
RNase-free water for gels and buffers, wash equipment 
with detergent and rinse thoroughly with RNase-free 
water. 

It is best to use freshly poured gels that are as thin as 
possible (i.e. 2-10 mm). Excessively long run times or 



high voltage can cause degradation of the bands on 
the gel. We recommend 4-8 volts/cm and running the 
bromophenol blue approximately 5 cm into the gel for 
good resolution. 

Adding ethidium bromide to agarose gels and running 
buffer at a final concentration of 0.5 jog/ml will 
effectively stain the bands during electrophoresis. 

Denaturing vs. Native Agarose Gels: 

It is common practice to electrophorese RNA on a fully 
denaturing agarose gel, such as one containing 
formaldehyde (1). However, in many cases it is possible 
to run RNA on a native agarose gel and obtain suitable 
results. In fact, it has been demonstrated that treatment 
of RNA samples in a denaturing buffer maintains the 
RNA molecules in a denatured state, during electro- 
phoresis, for at least 3 hours (2,3). The use of native 
agarose gels eliminates problems associated with toxic 
chemicals and the difficulties encountered when staining 
and blotting formaldehyde gels. 

References: 

(1) Sambrook, J., Fritsch, E. F. and Maniatis, T. (1989). 
Molecular Cloning: A Laboratory Manual, (2nd ed.j, 
(pp. 7.43-7.45). Cold Spring Harbor: Cold Spring 
Harbor Laboratory Press. 

(2) Liu, Y-C, Chou, Y-C. (1990) BiotechniquesV, 558. 

(3) Sandra Cook and Christina Marchetti, unpublished 
observations. 



2-Log DNA Ladder (0.1-10.0 kb) 
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10 yg of 2-Log 
DNA Ladder 
visualized by 
ethidium bromide 
staining on a 1.0% 
TBE agarose gel. 



Description: A number of proprietary plasmids are 
digested to completion with appropriate restriction 
enzymes to yield 19 bands suitable for use as molecular 
weight standards for agarose gel electrophoresis. This 
digested DNA includes fragments ranging from 100 bp 
to 10 kb. The 0.5, 1.0 and 3.0 kb bands have increased 
intensity to serve as reference points. 

Preparation: Double-stranded DNA is digested to 
completion with the appropriate restriction enzymes, 
phenol extracted and equilibrated to 10 mM Tris-HCI 
(pH 8.0) and 1 mM EDTA. 

Concentration: 1,000 uij/ml. 

Storage Conditions: 10 mM Tris-HCI (pH 8.0) 

and 1 mM EDTA. For long term storage, store at -20°C. 

2-Log DNA Ladder is stable for at least 3 months at 4°C. 

Note: All fragments have 4-base, 5' overhangs that can 
be end labeled using T4 Polynucleotide Kinase (NEB 
#M0201) or filled-in using DNA Polymerase I, Klenow 
Fragment (NEB #M0210) (1). Use a-[ 32 P] dATP or a-PP] 
dTTP for the fill-in reaction. 

Usage Recommendation: We recommend loading 
1 ng of the 2-Log DNA Ladder diluted in sample buffer. 
This ladder was not designed for precise quantification of 
DNA mass but can be used for approximating the mass of 
DNA in comparably intense samples of similar size. 



The approximate mass of DNA in each of the bands in 
our 2-Log DNA Ladder is as follows (assuming a 1 \jq 
loading): 



Fragment 


Base Pairs 


DNA Mass 


1 


10,002 


40 ng 


2 


8,001 


40 ng 


3 


6,001 


48 ng 


4 


5,001 


40 ng 


5 


4,001 


32 ng 


6 


3,001 


120 ng 


7 


2,017 


40 ng 


8 


1,517 


57 ng 


9 


1,200 


45 ng 


10 


1,000 


122 ng 


11 


900 


34 ng 


12 


800 


31 ng 


13 


700 


27 ng 


14 


600 


23 ng 


15a 
15b 


517 ^ 
500 ^ 


124 ng 


16 


400 


49 ng 


17 


300 


37 ng 


18 


200 


32 ng 


19 


100 


61 ng 



Reference: 

(1) Sambrook, J., Fritsch, E. F. and Maniatis, T. 
(1989). Molecular Cloning: A Laboratory Manual, 
(2nd ed.) t (pp. 10.51-10.67). Cold Spring Harbor: 
Cold Spring Harbor Laboratory Press. 
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Front coven The photograph shows a rat nerve cell 
in culture. It is labeled ( yellow) with a fluorescent 
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processes. Nerve terminals ( green ) from other 
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acts. If these ^ nov ceU proteins differ among cells to the same extent as the 
^ore abundant proteins, as is commonly assumed, only a small number of pro - 
!!L differences (perhaps several hundred) suffice to create very large differences 
jp^eil morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specializations-different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
(translational control), (5) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2) . 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only d 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 



Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in proteins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 



Figure 9-3 Double -helical structure 
of DNA. The major and minor grooves 
on the outside of the double helix are 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogen 
light blue; hydrogen, white; oxygen 
red; phosphorus, yellow. 
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Figure 9-71 A mechanism to explain 
both the marked deficiency of CG 
sequences and the presence of CG 
Islands in vertebrate genomes* A 
black line marks the location of an 
unmethylated CG dinucleotide in the 
DNA sequence, while a red line marks 
the location of a methylated CG 
dinucleotide. 
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Summary 

the many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism far cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

in addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylatipn also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
though these posttranscriptional controls, which operate after RNA polymerase 
has bound to the gene's promoter and begun RNA synthesis, are less common 
foan transcriptional control, for many genes they are crucial. It seems that every 
step in gene expression that could.be controlled in principle is likely to be regu- 
kted under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 
according to the sequence of events that might be experienced by an RNA 
Molecule after its transcription has begun (Figure 9-72). 
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Figure 9-72 Possible post- 
transcriptional controls on gene 
expression. Only a few of these 
controls are likely to be used for any 
one gene. 
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Chesney Medical Archives of The Johns Hopkins Medical 
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Papers, Special Collections, Oregon State University; Nicholas 
Poussin, courtesy ofArtToday.com; Barbara McOintock, 
© David Mickloa, 1983; Andrei Sakharov, courtesy of Elena 
Bonner; Frederick Sanger, © The Nobel Foundation, 1958.) 
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FROM DNATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, ceils can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency; allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 5-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Figure 6-3 Genes can be express*) 
with different efficiencies. Gene A is 
transcribed and translated much mare 
efficient^ than gene B.This allows the 
amount of protein A In the cell to be 
much greater than that of protein B. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of Its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is soil written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two* respects; (1) the nucleotides in RNA are 
ribomufeotides-^haX is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deojryribose; (2) although, like DNA, RNA contains the 
bases adenine CA)> guanine (G), and cytosine (Q, it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like % can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U), It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 

Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic Illustration of the type of 
conformational change In a protein that produces material for a cross-beta filament. (B) Diagram Illustrating 
the self-infectious nature of the protein aggregation that is central to prion diseases. PrP Is highly unusual 
because the mkfolded version of the protein, called PrP*, Induces the normal PrP protein It contacts' to 
change (ts conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that Is especially prone to aggregation, but because this structure Is 
not infectious In this way, It cannot spread from one animal to another. (Q Drawing of a cross-beta filament, 
a common type of protease-reslstant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions In a 0 sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP* showing the likely change of two OrheBces into four 
p-sqands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell adapted from M. Sunde et ai.JL Mo/. Bid. 273:72^739. 
1 997; D, adapted from S.B. Frusiner. Trends Bfodiem. Scf. 21 :482-«7, 1 996.) 



animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow dis ease 0 ) from cattle to humans in Gre at Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an Infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious *protein-only inheritance" observed 
in yeast cells. 



There Are Many Steps From DNAto Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs, hi principle, any or all of the steps inFig- 
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Figure 6-90 The production of a 
protein by a eucaryotk cell. Hie final 
level of each protein in a eucaiyotic cell 
depends upon the efficiency of each step 
depicted. 



\5i 



IAAAA mRNA 
EXPORT poly* tail 



NUCLEUS, 



CYT0S0L 



mRNA 



IMITATION OF PROTEIN SYNTHESIS 




COMPLETION OF. PROTEIN SYNTHESIS 
■AUD PROTEIN FOLDING 



COOH 

PROTEIN DEGRADATION % 



COOH 



ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is die most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a targe ribon ucleopro tein assembly called a ribosome. The 
amino adds used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pain with the tRNAanttcodon. Each amino add is added to the 
^terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be ' 
controlled. Controls that operate at 
staps I through 5 are discussed ip this 
chapter. Step 6, the regulation of protein 
activity, Includes reversible activation or 
Inactfvatlon by protein phosphorylation' 
(discussed m Chapter 3) as we8 as 
irreversible Inactivadon by proteolytic 
degradation (discussed in Chapter 6)< 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can inprinciple be regulated. Thus 
a cell can control the proteins it makes by (I) controlling when and how often a 
given gene is transcribed (transcriptional control) , (2)' controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and detennining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and UNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of ceUs in multicellular organisms arise 
because different sets of genes are expressed. Moreover, ceils ca n change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although aU of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription 1s the most 
Important point of control 
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DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
;by a single signal. Many others are complex and act as tiny microprocessors, 
• tesponding to a variety of signals that they interpret and integrate to switch the 
[neighboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages * (including most housekeeping genes) are kept 
unraethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
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Summary 

The many types of cells in animals and plants are created largely, through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell . 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a ceU between two alternative 
patterns of gene expression. Director indirect positive feedback loops, which enable . 
gene regulatory proteins to perpetua te their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

tneucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in me regulation of marry genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactwation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylatlon also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms, DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



CG island 



Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in .vertebrate 
genomes. A black fine marks die location 
of a CG ^nucleotide In the DNA 
sequence, while a red "lollipop" Indicates 
the presence of a methyl group on the 
CG dirtudeotide. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained In 
evolution. Methylated CG sequences, on 
the ofher hand, tend to be lost through 
deamination of 5-methyl C to T unless the 
GG sequence Is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only -a few of them. Controls on the initiation of 
gene transcription are die predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
die amount of gene product that is made. Although these posttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control for 
many genes they are crucial 
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Regulation of transcription 



^tf phenoiypic diherences Unit distinguish the 
carious kinds or trlls in a higher ctifcnryole are 
largely due to' differences *ui the expression of 
. rA es thnl code fnr proteins, thnl is. those (ran* 
bribed hy RNA polymerase It. In principle. lit* 
^presslon of these penes might lie regulated at 
an v one or several stages. The concept or Hie 
-level or control" implies thai gene expression 
* 5s pot necessarily an automatic process once it 
tas begun. If could In* regulated m a gene- 
specific way al any one or several sequential 
jieps. We can distinguish <nt least) five poten- 
tial control points, funning the series: 

Activation of gene stnuiuiv 

Initiation of transriipitun 
i 

Processing the trsiuSiTinj 

Transport to cytoplasm 
1 

Translation of mRNA 

The existence or the firsl Mcp Is implied by 
the discovery Hint genes mny exist in either of 
im> structural conditions. Ilelnllvc to the stale 
of must of the gwrmie. genes are round in 
mi •aciive" stale hi the cells in which they 
are expressed (see Chapter 21). The change of 
structure Is. distinct frmn the ad oC transcrlp* 
lion, and indicates that Ihe gene is *transcrip- 
able." This suggests that acquisition of the 
"active* structure must be the first step in gene 
expression. 

transcription or a gene In the active stale is 



controlled al Ihe sins? of initiation, that Is. hy 
the interaction of UNA polymerase with its pro- 
moter. This Is now becoming susceptible to 
analysis In tike /// r///x> systems {see Chapter 
58). For most 5 cues, this is a major control 
point: proi»b1y it is the most common level of 
regulation. 

There is at present no evidence for control 
al subsequent stages of transcription in eubary* 
otic cells, for example* via a nt iter mi nation 
mechanisms. 

The primary transcript Is modified by capping 
at the S> end! and usually also by potyadenyla- 
tion at the 3' end. hurons must Ik spliced out 
Iruhi the transcripts of Interrupted genes. The 
mature RNA most he exported Prom the nucleus 
in tlie cytoplasm. Regulation or getie expression 
by selecllon or sequences at the level of nuclear 
RNA might Involve any or all or these stages, 
hut the one Tor which we have most evidence 
concerns changes In splicing; some genes are 
expressed by means of alternative splicing pot- 
terns whose regulation controls the type of pro- 
tein producl (see Chapter 30). 

Finally. Ihe translation of an mRNA In the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of litis mechanism in 
aduU somatic cells, but H dries occur in some 
embryonic situations, as described in'Chnpter 
"The mechanism is presumed to involve the block- 
ing of Initiation of translation or some mRNAs by 
specific protein factors. 

8ut having acknowledged that control of gene 
expression can occur at multiple singes, and 
that production of RNA cannot inevitably be 
equaled with production of protein, it Is clear 
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that the overwhelming majority or regulatory 
.events occur at the initiation of transcription. 
Regulation or tissue-specific gene transcription 
lies at the heart or eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of .target 
genes, and we seek to answer two questions 
about this mode of regulation: whaj identifies 
the common target genes to the transcription 
factor, and how is the activity or the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



ttesponse elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter clement that is recognized 
by a regulatory transaction factor. An element 
that causes a gene to respond to such a factor 
is called a response element examples are the 
HSE (heat shock response element), CRE 
(glucocorticoid response dement), SE&E (scrum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29.1, Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily Identical. The region bound by the factor 
extends for a short distance on either side of 



Table 29.1 Inducible transcrpton feelers twitf ic 
response cluincnls thai identify Qrcups cf rromcleis 
ot enhancsrs sutjeci to coordinate ccrtrcl. 



Reoutaory Agent Module Consensus 



Factor 



Heaithoch HSE CNNGAAHNTCCNNG HSTF 

Gfccocortfcott GRE TGGTACAAATGTTCT fttcvpior 

PhocbotBSttr TOE TGACTCA API 

Serum SHE CCATATTAGG SflF 



the consensus sequence, in promoters, the ele- 
ments are not present at fixed distances from 
the sJaripoint, but* are usually <200 bp upstream 
of It. The presence or a single element usuaUr 
is sufficient to confer the- regulatory response- 
but sometimes there are multiple copies. 

Response elements may be located. in V 10 " 
meters or in enhancers. Some rypes of element 
are typically found in one rather than the other, 
usually an HSE Is found In a promoter, while » 
ORE is found in an enhancer. We assume th* 
all response elements function by the ***** 
general principle. A gene is regulated fc? 0 
sequence at Uie promoter or enhancer (hat p 
recognized by a specific protein. The P*** 
functions as a transcription factor needed F 
WV:i polymerase to initial*.' Active protein 
available only under conditions when the. 
to be expressed; its absence means that the rT 
ntoter* is not activated by this particular o^J* 

An example or a situation in which » 
genes are controlled by a single factor & 
vided by the heat shock response. This & *JJJ 
mon to a wide range of prokiu^otes ^ 
eutaryoles and Involves multiple conlF ^ Hl rr 
gene expression; an increase in K* 1 P er * „* 
turns off transcription of some genes, turn ^ 
transcription of the heat shock 
causes changes in the translation of t^J 
The control of the heat shock genes 



the differences between prokary*^ ^ 
eukaryotic modes of control. In. bacterid 
Sigma factor is synthesized that <* lnrCl * ^r^ 
polymerase holoenzyme to recognize an. 
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Background: Prostate stem cell antigen (PSCA) is a recently defined homologue of the Thy-l/Ly-6 family of 
glycosylphosphatldylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was semiquantitatively scored by assessing both the 
percentage and intensity of PSCA-positive staining ceils in the specimens. Then .compared PSCA expression 
between BPH. PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-lndependence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGP1N and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of II (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05, respectively). In addition, IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen- in dependence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China. Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al [1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of the Thy-I/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n = 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n « 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fixed in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0. 1 M PBS PH 7,4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
|im sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score [2]. 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGPIN, 
grade III) on the basis of literatures [3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, 
and subjected to microwaving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1:100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-um-thick tissue sections were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5 °C, and further proc- 
essed for ISH. Digoxigenin-Iabeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48 °C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37 0 C in 2 * standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 15 min 
and in 0.2 x SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5 °C fori h followed by washing in 1 x PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5°C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table I: Correlation of PSCA expression with Gleason score 



Intensity * frequency 

Gleason score 0-6 (%) 9 {%) 

2-4 5(83) I(I7) 

5-7 19 (79) 5 (21) 

8-10 5(28) 13(72) 



Table 2: Correlation of PSCA expression with clinical Stage 

Intensity * frequency 



Tumor stage 0-6 (%) 9 (%) 



SB 

S>C 


27(67.5) 
2(25) 


13(32.5) 
6(75) 





terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. AJ1 slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA 
immunosiaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by 1HC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
ently experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitatively for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 - positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5]. In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's t-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen-independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression In BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expression In PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (<2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 11 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 1 1 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall Pea 
expressed a significantly higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores = poor-differentiation (7). Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score - S) f compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significantly with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostalning and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunostaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, *200 magnification). A„ B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. A 2 . Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score - 
6) in all malignant cells; A 2 : IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. A 3 , B 3 : a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) In all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
thai mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostate-specific in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts 
(UPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
et al [ 1 ], using ISH analysis, reported that 9 7 of 1 1 8 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar rinding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al [8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9]. Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5]. These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu Z et al [9), who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
[11] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al [12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 [1], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis [13-15]. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
RE el al [18] reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al (9] recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18]. Watabe T et al |19] reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) [20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and transducts 
signals through src-like tyrosine kinases [22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion [23-25]. Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [1]. Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstract 

Translation Initiation is regulated fn response to 
nutrient availability and mitogenlc stimulation and Is 
coupled with ceD cycle progression and cefl growth. 
Several atterations In translatiorta] control occur In 
cancer. Variant mRMA sequences can alter the 
translationai efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in 
the expression or availability of components of the 
translationai machinery and In the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase In 
the overall rate of protein synthesis and translationai 
activation of the mRNA molecules Involved In cefl 
growth and proliferation. We review the basic 
principles of translationai control, the alterations 
encountered In cancer, and selected therapies 
targeting translation Initiation to help elucidate new 
therapeutic avenues. 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer Is to exploit the differences In gene expression between 
cancer ceils and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences In transcription. Gene expression is quite complicated, 
however, and Is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translationai regulation has been best recog- 
nized among developmental biologists, because transcription 
does not occur in early embryogenesis in eukaryotes. For ex- 
ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches mldblastula transition, the 
4000-celi stage. Therefore, all necessary mRNA molecules ana 
transcribed during oogenesis and stockpiled in a transfattonalry 
Inactive, masked form. The mRNA are transfaConafly activated 
at appropriate times during oocyte maturation, fertilization, and 



early embryogenesis and thus, are under strict transnational 
control. 

Translation has an established role In cell growth. Basi- 
cally, an Increase In protein synthesis occurs as a conse- 
quence of mitogenesls. Until recently, however, little was 
known about the alterations in mRNA translation In cancer, 
and much Is yet to be discovered about their role In the 
development and progression of cancer. Here we review the 
basic principles of translationai control, the alterations en- 
countered in cancer, and selected therapies targeting transla- 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translationai Control 
Mechanism of Translation Initiation 
Translation initiation Is the main step In translationai regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the 40$ and 60S ribosomai subunlts are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation Initiation factors into an 80S ribosome at the start 
codon of the mRNA (Rg. 1 ). The 5' end of eukgryotfc mRNA Is 
capped, /.a, contains the cap structure m 7 GpppN (7-methyi- 
g^rarraslne-triprx^ Most translation In 

eukaryotes occurs In a cap-dependent fashion, La, the cap is 
specifically recognized by the elF4E, 3 which binds the 5' cap. 
The eIRF translation initiation complex Is then formed by the 
assembly of elF4E, the RNA heficase elF4A, and elF4G, a 
scaffolding protein that mediates the binding of the 40S ribo- 
somai sufaunft to the mRNA molecule through Interaction with 
the e!F3 protein present on the 40S ribosoma elF4A and elF4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA. The 43S initiation complex (40S/elF2/Met-tRNA/ 
GTP complex) scans the mRNA in a 5'-*3' direction until it 
encounters an AUG start codon. This start codon Is then base- 
paired to the anticodon of initiator tRNA. forming the 48S Initi- 
ation complex. The Initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosome. 

Unlike most eukaryotic translation, translation Initiation of 
certain mRNAs, such as the picomavirus RNA, Is cap Inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require elF4E Ether the 43S complex can bind 
the initiation codon directly through Interaction with the IRES in 
the 5' UTR such as in the encephalomyocanditis virus, or it can 
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Fg. 7. Translation initiation In eukaiyotes. The 4E-BP8 ere hyperpho3- 
pnoryiated to release eJF4E so that ft can Interact with the 5' cap, and the 
e*F4F initiation complex is assembled. The interaction of poly{A) binding 
protein with the initiation complex and circtttarization of the mRNA ts not 
depicted In the diagram. The socondaxy structure of the 5' UTR is melted, 
trie 40S rtbosoma) subunit la bound td eJF3, and the ternary complex 
consisting of elF2, OTP. and the Met-tRNA are recruited to the mRNA. The 
rtbo3omo scans the mRNA in a 5'-»3' direction until an AUG start codon 
Id found in the appropriate sequence context The initiation factors are 
released, and the large ribosomal subunft is recruitecL 



Initially attach to the IRES and then reach the Initiation codon by 
scanning or transfer, as Is the case with the poliovirus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
Involved. Key components In translations] regulation that 
may provide potential therapeutic targets follow. 

elF4E. elF4E plays a central role In translation regulation. 
It Is the least abundant of the initiation factors and is con- 
sidered the nate-ilmrtlng component for Initiation of cap- 
dependent translation. elF4E may also be Involved In mRNA 
splicing, mRNA 3' processing, and mRNA nutieocytopras- 
mic transport (2). elF4E expression can be Increased at the 
transcriptional level In response to serum or growth factors 
(3). elf=4E overexpresslon may cause preferential translation 
of mRNAs containing excessive secondary structure In their 
5' UTR that are normally discriminated against by the trans- 



lations! machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpresslon of elF4E promotes In- 
creased translation of vascular endothelial growth factor, 
fibroblast growth fector-2, and cyciln D1 (2, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. eiF4E phosphorylation is mediated by the 
mltogen-activated protein Wnase-interactlng kinase 1, which 
Is activated by the mrtogen-actlvated pathway activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mltogen-activated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, Insulin, 
angiotensin II, src kinase overexpresslon, and has over- 
expression, lead to elF4E phosphorylation (14). The phos- 
phorylation status of 6IF4E Is usually correlated with the 
translations! rate and growth status of the cell; however; 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when translations! rates actually 
decrease (15). Thus, further study Is needed to understand 
the effects of e!F4E phosphorylation on eIF4E activity. 

Another mechanism of regulation is the alteration of eIF4E 
availability by the binding of eIF4E to the elF4E~blndlng pro- 
teins (4E-BP, also known as PHAS-I). 4E-BPs compete with 
elF4G for a binding site In e1F4E The binding of e!F4E to the 
best characterized elF4E-bindlng protein, 4E-BP1, Is regu- 
lated by 4E-BP1 phosphorylation. Mypcphosphprylated 4E- 
BP1 binds to e!F4E, whereas 4E-BP1 hyperphosphc^ylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and II, InterteuWn 3, granulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to Induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of nutrients or growth factors results 
In 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease In cap-dependent translation. 

p70S6 Kinase. Phosphorylation of ribosomal AOS protein 
86 by S6K Is thought to play an important role In translation^ 
regulation. S6K -/- mouse embryonic ceils proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on ceil proliferation (1 7). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ofigopyrrrrrfolne tract TOP)foijndatti»5 f irmof rflx3Somal 
protein mRNAs and other mRNAs coding for components of 
the translations) machinery. Phosphorylation of S6K Is regu- 
fated in part based on the availability of niitrients(18,1^)andls 
stimulated by several growth factors, such as pfatetet-dertved 
growth factor and fnsulMke growth factor I £0). 

elF2« Phosphorylation. The binding of the Initiator tRNA 
to the small ribosomal unit Is mediated by translation initia- 
tion factor eJFZ Phosphorylation of the a-subunit of elF2 
prevents flirtation of the elF2/GTP/Mat-tRNA complex and 
Inhibits global protein synthesis (21, 22). eIF2a Is phospho- 
ryiated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
etF2« Is phosptoryfated by herne-regulated Inhibitor, nutrient- 
regulated protein kinase, and the IFN -induced, double- 
stranded RNA-actJvated protein kinase (PKR; Ret. 23). 
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The mTOR Signaling Pathway. The macrollde antibiotic 
rapamycln {SiraHmus; Wyeth-Ayenst Research. CoDegevllle, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways Involved In T-cell activa- 
tion. The rapamycln-sensfttve component of these pathways 
Is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homofogue of the yeast TOR proteins that regulate Q 1 
progression and translation In response to nutrient availabil- 
ity (24). mTOR is a serine-threonine kinase that modulates 
translation Initiation by altering the phosphorylation status of 
4E-BP1 and S6K (Fig. 2; Ret 25). 

4E-BP1 Is phosphorylated on multiple residues. mTOR phos- 
phorates the Thr-37 and Thr-48 residues of 4E-BP1 in vhro 
(26); however, phosphorylation at these sites b not associated 
with a loss of e!F4E binding. Rtosphorytation of Thr-37 and 
Thr-46 is required for subsequent phosphorylation at several 
COOH-termlnal, serum-sensitive sites; a combination of these 
phosphorylation events appears to be needed to inhibit the 
binding of 4E-BP1 to elF4£ (25). The product of the ATM gene, 
P38/MSK1 pathway, and protein kinase Co- also play a rote fn 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated. In part, by P13K and Its 
downstream protein kinase Akt PTCN Is a phosphatase that 
negatively regulates PI3K signaling. PTEN nuQ cells have 
constltutrVely active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity Is inhibited both by 
PI3K Inhibitors wortmannin and LY294002 and by mTOR 
Inhibitor rapamycln (24). Akt phosphorylates Ser-2448 in 
mTOR in vltro t and this site Is phosphorylated upon Akt 
activation in vivo (31-33). Thus, mTOR Is regulated by the 
POK/Akt pathway; however, this does hot appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR la controversial. 

Interestingly, mTOR autophosphoiyiatfon Is blocked by wort- 
mannin but not by rapamycln (34). This seeming Inconsistency 
suggests that mTOR-responslve regulation of 4E-BP1 and S8K 
activity occurs through a mechanism other than intrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with calyculrn A, an inhibitor of phosphatases 1 
and 2A, reduces rapamydn-induoed dephosphorylation of 4E- 
BP1 and S6K by rapamycln (35). PP2A Interacts with full-length 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamycln. mTOR phosphorylates PP2A 
in Wfro; however, how this process alters PP2A actrvtty is not 
known. These results are consistent with the model that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycln block Inhibition of the phosphatase by 
mTOR. 

Poryadenyiation. The pofy{A) tail In eukaryotic mRNA Is 
important In enhancing translation Initiation and mRNA sta- 
bility. Poryadenyiation plays a key rote in regulating gene 
expression during oogenesis and early embryogenesls. 
Some mRNA that are translation ally inactive in the oocyte are 
potyadenylated concomitantly with translationaJ activation In 
oocyte maturation, whereas other mRNAs that are trartsla- 
tlonaify active during oogenesis are deadenylated and trans- 
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lationaliy silenced (38-38). Thus, control of po!y(A) tall syn- 
thesis Is an Important regulatory step in gene expression. 
The 5' cap and poly(A) tall are thought to function synergfe- 
tically to regulate mRNA translationaJ efficiency (39, 40). 

RNA Packaging. Most RNA-Wndlng proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translationaJ fate of the transcript (41). A highly 
conserved family of Y-box proteins is found In cytoplasmic 
messenger ribonucleoproteln particles, where the proteins 
are thought to play a role In restricting the recruitment of 
mRNA to the translatlonal machinery (41-43), The major 
mRNA-associated protein, YB~1 , destabilizes the interaction 
of el F4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results In translatlonal repression In vfvo (44). Thus, 
alterations in RNA packaging can also play an important role 
In translationaJ regulation. 

Translation Alterations Encountered in Cancer 

Three main alterations at the translatfonal level occur In cancer 
variations in mRNA sequences that increase or decrease trans- 
lationaJ efficiency, changes In the expression or avaSabHity of 
components of the translationaJ machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects the translation of an Indi- 
vidual mRNA that may play a role in carcinogenesis. The sec* 
ond and third alterations can lead to more global changes, such 
as an increase In the overafi rate of protein synthesis, and the 
translationaJ activation of several mRNA species. 

Variations In mRNA Sequence 
Variations In mRNA sequence affect the transactional effi- 
ciency of the transcript A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations In the mRNA sequence, especially 
in the 5' UTR, can alter Its translatlonal efficiency, as seen In 
the following examples. 
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c-myc Salto eta!, proposed that translation of fulHength 
o-myc Is repressed, whereas In several BurWtt lymphomas 
that have deletions of the mRNA 5' UTR, translation of omyc 
Is more efficient (45). More recently, it was reported that the 
S ( LFTR of omyc contains an IRES, and thus omyc transla- 
tion can be initiated by a cap-Independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C-+T mutation in the omyc IRES was Identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (48). 
. BRCA1. A somatic point mutation (117 G-»C) in position 
-3 with respect to the start codon of the BRCA 1 gene was 
identified In a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream luctferase reporter dem- 
onstrated a decrease in the translations efficiency with the 5' 
UTR nutation. 

Cycfln-dopendent Kinase Inhibitor 2A. Some Inherited 
melanoma kindreds have a G-*T transversion at base -34 
of cycQrvdependent kinase !nhibftor-2A, which encodes a 
cyciln-dependent kinase 4/cydin-dependent kinase 6 kinase 
Inhibitor Important In G, checkpoint regulation (51). This 
mutation,, gives rise to a novel AUG translation Initiation 
codon. creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations In splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ay stnjctiire, ultimate Impact^ 

ATM. The ATM gene has four noncoding exons In its 5' 
UTR that undergo extensive alternative splicing (52). The 
contents of 12 different 5' UTRs that show considerable 
diversity In length and sequence have been identified. These 
cflvergent 5' leader sequences ptay an Important role In the 
translations! regulation of the ATM gene. 

mdnu In a subset of tumors, overexpression of the onco- 
protein mdm2 results In enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only In their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosomes inefficiently compared with 
the short 5 r UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRNA Is 
expressed with a shorter leader sequence (5'UTRa), whereas 
in sporadic breast cancer tissue, BRCA1 mRNA Is expressed 
with a longer leader sequence (5 r UTRb); the translation^ 
efficacy of transcripts containing 5' UTRb Is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-pZ TGF-03 mRNA includes a 1.1 -kb 5' UTR, which 
exerts an inhibitory effect on translation. Many human breast 
cancer cell Rnes contain a novel 7GF-03 transcript with a 5' 
UTR that Is 870 nucleotides shorter and has a 7-fold greater 
transitional efficiency than the normal TGF-03 mRNA (55). 

Alternate Polyadenyiatkm Sites. Multiple polyadenyl- 
ation signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the RET proto-oncogene (56), ATM gene 
(52), tissue inhibitor of metalloprotelnases-3 (57), RHOA 



proto-oncogene (58), and caJmodulln-1 (59). Although the 
effect of these alternate 3' UTRs on translation Is not yet 
known, they may be Important In RNA-protein Interactions 
that affect translatidnal recruitment The role of these alter- 
ations In cancer development and progression is unknown. 



Alterations In the Components of the 
Translation Machinery 

Alterations In the components of translation machinery can 
take many forms. 

Overexpresssion of elF4E. Overexpression of e!F4E 
causes maBgnant transformation In rodent cells (60) and the 
deregulation of HeLa ceil growth (61). Pofunovsky et aL (62) 
found that elF4£ overexpression substitutes for serum and 
individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broad 
spectrum of transformed ceO lines (63). elF4E levels are 
elevated in all ductal carcinoma In situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that efF4E overex- 
pression can be valuable as a prognostic marker, el F4E over- 
expression was found in a retrospective study to be a rnarker of 
poor prognosis in stages I to III breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer Is now 
under way in a prospective trial (67). However, In a different 
study, etF4E expression was correlated with the aggressive 
behavior of non-HodgWn's lymphomas (68). in a prospective 
analysis of patients whh heed and neck cancer, elevated levels 
of elF4E in histologically tumor-Jree surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results an suggest that elF4E overexpression can be 
used to select patients who might herefrom more aggressive 
systemic therapy. Further 

suggest that e!F4E overexpression Is afield defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of eJF4G, similar to elF4E, leads to 
malignant transformation In vitro (69). elF-2a is found In 
increased levels in bronchioloalveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunlt of 
translation initiation factor 3 is amplified and overexpressed 
In breast and prostate cancer (72), and the elF3-p1 10 subunlt 
Is overexpressed in testicular seminoma (73). The role that 
overexpression of these Initiation factors plays on the devel- 
opment and progression of cancer, If any, is not known. 

Overexpression of S6K. S6K Is amplified and highly 
overexpressed In the MCF7 breast cancer eel) line, com- 
pared with normal mammary epithelium (74). In a study by 
Bariund et a/. (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 



Overexpresslon of PAP. PAP catalyzes 3' po!y{A) syn- 
thesis. PAP Is overaxpressed In human cancer cells com- 
pared with normal and vfrafly transformed cells (75). PAP 
enzymatic activity In breast tumors has been correlated with 
PAP protein levels (76) and t in mammary tumor cytosote, was 
found to be an Independent factor for predicting survival (76). 
Uttle Is known, however, about how PAP expression or ac- 
tivity affects the translations! profile. 

Alterations In RNA-blndtng Proteins. Even less te known 
about alterations In RNA packaging In cancer. Increased ex- 
pression and nuclear localization of the RNA-blndlng protein 
YB-1 are Indteatore of a poor prognosis for breast cancer (77), 
rcxvsmal ce8 lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least In part a! the level of 
transcription, because YB-1 Increases diemoresfetence by en- 
hancing the transcription of a muttttug resistance gene (80). 



Activation of Signal Transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpresslon of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
Important .mutant In human cancers Is the tumor suppressor 
gene PTBV, which leads to the actlvatk)noftr»PI3K/AktpaMv 
way. Activation of PI3K and Akt Induces the oncogenic trans- 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (81). 
A mutant Akt that retains kinase activity but does not phos- 
phorytate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of PI3K and 
Akt and the ptospfaxylatton of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin-like growth footer, HER2/heu, and epidermal 
growth factor receptor are overexpressed In cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to after translation Initiation, activation 
of translation Is likely to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translations control. 
For example, HER2/neu mRNA Is tmnslationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a ceil type-Independent manner and 
by a distinct cell type-dependent mechanism that Increases 
translation al efficiency (82). HER2/neu translation Is different 
in transformed and normal cells. Thus, It Is possible that 
alterations at the translations! level can In part account for 
the discrepancy between HEB2/neu gene amplification de- 
tected by fluorescence In situ hybridization and protein levels 
detected by immunohlstochemlcal assays. 



Translation Targets . of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways Involved In the activation of translation Initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycln 
and Tumstatin 

Rapamycln Inhibits the proliferation of lymphocytes. It was 
Initially developed as an Immunosuppressive drug for organ 



transplantation. Rapamycln with FKBP 12 (FK50B-blndlng 
protein, M T 12,000) binds to mTOR to Inhibit fts function. 

Rapamydn causes a email but significant reduction in the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S8 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycln also 
blocks 4E-BP1 phosphorylation and Inhibits cap-dependent 
but not cap-Independent translation (17, 86). 

The rapamycivsensffive signal transduction pathway, acti- 
vated during malignant transformation and cancer progression, 
is row being studied as a target for canra 
tale, breast, 6mafl ceil lung, glioblastoma, melanoma, and T-cefl 
leukemia are among the cancer fines most sensitive to the 
rapamydn analogue CCt-779 (WyetiVAyerst Research; Ref. 
87). In rhabdomyosarcoma ceO lines, rapamydn te either cyto- 
static or cytoddal. depending on the p53 status of the ceHp53 
wfld^ype cells treated with rapamydn arrest In the 0, phase 
and maintain their viabOrty, whereas p53 mutant cefls accumu- 
late In ^ and urriergoapoptosis(88» 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
medulloblastoma models, rapamycln exhibited more cytotox- 
icity In c»mbination with ctsplatin and camptothedn than as a 
single agent In vivo, CCI-779 delayed growth of xenografts by 
160% after 1 week of therapy and 240% after2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth I nhibitio n h vivo was 1-3 tones greater, with 
dspiatin In combination with C0779 than wtth cispJatin alone 

(90) . Thus, preclinical studies suggest that rapamydn ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycln analogues CCI-779 and RADOOt {Novartls, 
Basel, Switzerland) are now In clinical trials. Because of the 
known effect of rapamydn on lymphocyte proliferation, a 
potential problem with rapamydn analogues Is immunosup- 
pression. However, although prolonged Immunosuppression 
can result from rapamydn and CCI-779 administered on 
continuous-dose schedules, the Immunosuppressive effects 
of rapamydn analogues resolve In -24 h after therapy 

(91) . The principal toxicities of CCI-779 have induded der- 
matologies! toxicity, myetosuppresskm, Infection, mucositis, 
diarrhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal celt carcinoma and In stage lll/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported In abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCI-779 has documented preliminary di nteal activity in 
a previously treated, unselected patient population. 

Active Investigation is under way Into patient selection lor 
mTOR Inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 In PTEN-nutl tumors (30, 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCI-779, although only two of these lines 
lacked PTEN 07) There was, however, a positive correlation 
between Akt activation and CCI-779 eensftMty (97). This 
correlation suggests that activation of the POK-Akt pathway, 



97G Tranriatfen Infttatton In Cancer 



regardless of whether It te attributable to a PTTEN mutation or 
to overexpression of receptor tyrosine kinases, makes can- 
cer celt amenable to mTOR-directed therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycln resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycln resistance (98). 

Another mode of activity for rapamycln and Its analogues 
appears to be through Inhibition of anglogenesfe. This activ- 
ity may be both through direct inhibition of endothelial ceil 
proliferation as a result of mTOR inhibition in these cells or by 
Inhibition of translation of such proangfogenlc factors as 
vascular endothelial growth factor In tumor cells (99, 100). 

The anglogenesis inhibitor tumstatln, another anticancer 
drug currently under study, was also found recently to Inhibit 
translation In endothelial cells (101). Through a requisite In- 
teraction with Integrin, tumstatln inhibits activation of the 
Pf3K/Akt pathway and mTOR in endothelial ceils and pre- 
vents dissociation of eiF4E from 4E-BP1. thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-signallng pathway. 



Targeting elF2cc EPA, Clotrimazole, mda-7, 
and Flavonolds 

EPA Is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as In animal models (104, 1 05). it blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2+ from 
Intracellular stores while Inhibiting their refining, thereby ac- 
tivating PKR PKR, in turn phosphorylates and inhibits elF2a, 
resulting In the Inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent In vftro and In vivo, inhibits ceil growth through 
depletion of Ca*+ stores, activation of PKR, and phospho- 
rylation of eIF2a (108). Consequently, clotrimazole preferen- 
tially decreases the expression of cydins A, E, and D1, 
resulting In blockage of the cell cycle in Q v 

mda-7 Is a novel tumor suppressor gene being developed 
as a gene therapy agent Adenoviral transfer of mda-7 (Ad- 
mda7) Induces apoptosls in many cancer cells including 
breast, colorectal, and lung cancer (1 07-1 09). Ad-mda7 also 
Induces and activates PKR, which leads to phosphorylation 
of elF2a and induction of apoptosls (110). 

Flavonolds such as genlstein and quercetin suppress tu- 
mor cell growth. All three mammalian elF2a kinases, PKR, 
heme-regulated Inhibitor, and PERK/PEK, are activated by 
flavonolds, with phosphorylation of elF2a and inhibition of 
protein synthesis (111). 



Targeting elF4A and elF4E: Antisense RNA 
and Peptides 

Antisense expression of eiF4A decreases the proliferation rate 
of melanoma ceils (1 1?). Sequestration of elRE by overexpres- 
sion of 4E-BP1 is proapoptotfc and decreases tumorigenicJty 
(113, 114). Reduction of eiF4E with antisense RNA decreases 
soft agar growth, Increases tumor latency, and Increases the 
rates of tu mor doubling times (7). Antisense elF4E RNA treat- 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential adjuvant therapy for head 
and neck cancers, particularly when elevated elF4€ Is found In 
surgical mangins. Small molecuie Inhibitors that bind the eIF4G/ 
4E-BP1 -binding domain of e(F4E are proapoptotfc (116) and 
are also being actively pursued. 



Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
Of gene therapy vectors encoding suicide genes with highly 
structured 5' UTR These mRNA would thus be at a competitive 
disadvantage in normal cells and not translate well, whereas in 
cancer cells, they would translate more efficiently. For example, 
the introduction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus type-1 thymidine 
kinase gene, allows for selective translation of herpes simplex 
virus type-1 thymidine kinase gene In breast cancer cell lines 
compared with normal mammary cefl lines and results In se- 
lective sensmvtty to ganciclovir (1 1 7). 



Toward the Future 

Translation Is a crucial process In every celL However, several 
alterations In translational control occur in cancer. Cancer eel Is 
appear to need an aberrantly activated translational state for 
survival, thus allowing the targeting of translation initiation wfth 
surprisingly low toxicity. Components of the translational ma- 
chinery, such as eJF4E, and signal transduction pathways fcrv 
vofved in translation Initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of tire mTOR have already 
shown some preliminary activity in clinical trials. It Is possible 
that with the development of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR Inhibitors are most (feely to achieve cfrtlcaJ utility in 
combination therapy. In the interim, our increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to toad to the identification of new therapeutic targets 
in the near future. 
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Correlation between Protein and mRNA Abundance in Yeast 

STEVEN P. GYGI, WAN ROCHON, B. ROBERT FRANZA, and RUEDI AEBERSOLD* 
Department of Molecular Biotechnology, University of Washington, Seattle, Washington 98195-7730 
Received 5 October 1998/Retumed for modification 11 November 1998/Accepted 2 December 1998 

We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-tog phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chroraatograpny-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient. 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
display-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by rnicrospray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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FIG. I. Schematic illustration of protcomc analysis by 2DE and mass spectrometry. In part I, proteins arc separated by 2DE, stained spots arc excised and subjected 
to in-gcl digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performance liquid chromatography. In part II. a peptide is shown 
cluting from the column in part I. The peptide is ionized by clcctrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass filter allows only the specific mass-to-chargc ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of differing lengths (masses). These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadrupole mass filter where two ion 
scries arc recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part III, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides arc identified as being derived from the same protein. 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell Iysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 {MAT* um3-52 tys2-801 ade2-101 
ku2-LI his3-t200trpl-L63) (30). Logarithmically growing cells were obtained by 
growing yeast cells to early log phase (3 X 10 6 cells/ml) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) 
at 30"C (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Garrcls and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80°C 

2DE. Soluble proteins were run in the first dimension by using a commercial 
flatbed electrophoresis system (Multiphor II; Pharmacia Biotech). Immobilized 
polyacrylamidc gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Am ersham-Ph armaria Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell tysatcs was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nontdet P-40, 10 mM dithiothrcitol), and 250 
to 380 pJ of solution was added lo individual lanes of an IPG strip rehydration 
tray (Amersham-Pharmacia Biotech). The strips were allowed to rchydratc at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibratcd for 8 min in 2% (wt/vol) dithiothrcitol in 
2% (wt/vol) SDS-6 M urca-30% (wt/vol) glyceroM>.05 M Tris HQ (pH 6.8) and 
for 4 min in 2.5% iodoacctamidc in 2% (wt*ol) SDS-6 M urca-30% (wt/vol) 
glyccrol-0.05 M Tris HCl (pH 6.8). Following ^equilibration, the strips were 
transferred and apposed to 10% polyacrytanudc second-dimension gels. Poly- 
acrylamidc gels were poured in a casting stand with 10% acrylamidc-2.67% 
pipcrazinc diacrylamidc-0.375 M Tris base-HQ (pH 6\8)-0.1% (wtfvol) SDS-<105% 
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FIG. 2. 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the enure range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometry and database searching. The spots arc labeled on the 
gel and correspond to the data presented in Table 1. Molecular weights are given in thousands. 



(wttort) ammonium persulfatc-O.05% TEMED (A/^^'^-tetramethylcthyl- 
encdiaminc) in Milli-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycosctcnccs, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
SO mA (constant)-500 V-85 W for 20 min, followed by 200 mA (constant)-500 
V-85 W until the buffer front line was 10 to IS mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shcvchcnko 
ct al (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 1S6 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by mi* 
crocapiUary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein sequences) (24a) by using the 
computer program Sequcst (8), which matches theoretical and acquired tandem 
mass spectra. A protein match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Vclculescu and coworkers have previously generated 
frequency tables for yeast mRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3 '-most cutting site for NIalU. Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Vclculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(0 no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGE tags were present for multiple 
genes (eg., TDH2_YEAST and TDH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 TABLE 1— Continued 



Molwt 


Pi 


Spot no. 


YPD gene 
name" 


Protein 
abundance 
(10* copies/ 
cell) 


mRNA 
abundance 
I copies/ cell) 


Codon 
bias 


Mo! wt 


pi 


Spot no. 


YPD gene 
name" 


Protein 
abundance 
(10* copies/ 
cell) 


mRNA 

aDunaancc 


Codon 
bias 


17,259 


6.75 


133 


CPR1 


15.2 


61.7 


0.769 


39,477 


538 


86 


roAl 


17 D 
1 /.o 


1113 ft 
105.0 


ft 03C 
U.7J5 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


39,477 


538 


87 


FBAl 


X77 7 

427.2 


TK3 ft 
103.0 


ft 03C 


18,726 


4.44 


147 


YKL056C 


61.2 


88.4 


0.831 


39,540 


c cn 
630 


1 CA 

150 


HUM2 


OU.J 


A 

4.J 


ft CQ7 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


39,561 


6.12 


156 


PSAl 


OC A 

90.4 


77 C 
2/.3 


0.718 


19,108 


5.04 


130 


YLR109W 


94.4 


9.7 


0.680 


41,158 


c ni 
6.01 


AO 

49 


YNL134U 


1 A O 

i4.y 


1 c 
I, J 


ft lift 


19,681 


9.08 


136 


ATP7 


11.0 


NA 6 * 


0246 


41,623 
41,728 


7.18 


CO 

58 


BAT2 


ion 
19.0 


0 0 

0.7 


ft 7Cft 


20,505 


6.07 


111 


GUK1 


16.5 


3.7 


0.422 


7.29 


11ft 
110 


T7T5 /""» 1 A 

EKG10 


7vi 1 

24.1 


A C 
4.5 


ft CAT. 
U.545 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


aa An a 
41,900 

42,402 


C A*\ 

5.42 


1A 

74 


TUM4U 


77 7 

22.3 


7 7 
4*2 


ft 17C 
UJ /5 


21,583 


4.98 


95 


TSA1 


110.6 


40.1 


0.845 


6.29 


AC 

45 


CYo3 


C 7 

0./ 


ft a 


ft ft71 


22,602 


430 


80 


EFB1 


66.1 


23.8 


0.875 


42,883 


5.63 


67 


DYS1 


ICQ 

15.o 


C 7 
5.2 


ft <7ft 


23,079 


6.29 


112 


SOD2 


1Z6 


2.2 


0.351 


43,409 


6.31 


107 


CCD 1 


in c 
10.5 


1 c 

1.5 


ft 70/7 


23,743 


5.44 


137 


HSP26 


NA rf 


0.7 


0.434 


A1 A*it 

43,421 


C Cft 

5.59 


91 


■ctj nc 


7 7 

2.2 


1A 1 
14.1 


ft AftR 


24,033 


5.97 


96 


ADK1 


17.4 


16.4 


0.656 


A A \1A 

44,174 


7.32 


cc 
5o 


YBKU25U 


15 1 

13.1 


ft n 


ft ftftA 


24 f 058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


44,682 


4.99 


72 


llrl 


7 O 

2.7 


30 A 
37.4 


ft Q.1A 


24,353 


6.30 


140 


TFS1 


8.1 


0.7 


0.146 


A A 7fl7 

44,7U7 


7 77 

7.77 


1 no 
1U8 


rvjrlVl 


7*3 7 


1ft*\ 7 
105./ 


ft R07 
U.o7 / 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


a ico 
U.359 


A A 7H7 

44, /U/ 


7 77 


mo 
1U7 


Jrvtt.1 




IftS 7 
JO J. / 


ft H07 
W.07 / 


24,808 


633 


97 


GSP1 


26.3 


5.2 


n TIC 

0.735 


ac non 
4o,UoO 


C 77 

0.72 


3U 




1 C. A 




ft AOS 


24,908 


8.73 


122 


RPS5 


18.6 


NA C 


0.899 


463o3 


O C7 

832 


c3 

53 


TTMM 


7 7 

/./ 


n 7 
u./ 


ft A3ft 


25,081 


4.65 


81 


MRP8 


9.3 


NA C 


A nAt 

0.241 


45,553 


C QO 

5.98 


47 


lurZ 


37 A 


MAC 
fN/\ 


ft 1Q7 


25,960 


6.06 


116 


RPE1 


5.8 


0.7 


0372 


ac cno 
46,0/9 


c io 
6.39 


cn 
50 


cvni 
JbNUl 


1C A 
35.4 


A 7 
U./ 


ft 03ft 

U.7JU 


26,378 


9.55 


127 


RPS3 


96.8 


NA C 


0.863 


ac cnt\ 
40,0/9 


c in 
0.39 


CI 

51 


cfSUl 


0.0 


n 7 
o./ 


U.7JU 


26,467 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


46,679 


6.39 


52 


cNUl 


7 7 
2.2 


n 7 

u. / 


ft 03ft 


26,661 


5.84 


98 


TPI1 


NA rf 


NA e 


0.900 


46,773 


5.82 


63 


cNUZ 


ICC 

153 


700 1 
207.1 


n o^n 


27,156 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


46,773 


5.82 


CA 

54 


JbNU2 


JC7C c 

035 O 


7&0 1 
20 7.1 


n Oftft 
u.you 


27,334 


6.13 


115 


YHR049W 


18.4 


2.2 


0320 


46,773 


5.82 


65 


TJKTA1 

fcJM(J2 


07 n 
93.0 


7ftO 1 

207. 1 


ft Oftft 

u.you 


27,472 


533 


92 


YNL010W 


31.6 


3.7 


0.421 


46,773 


5,82 


66 


EN02 


71 n 
31.0 


7QO 1 
207.1 


ft Oftft 

u.you 


27,480 


8.95 


123 


GPM1 


10.0 


169.4 


0.902 


47,402 


6.09 


126 


CUK1 


7 C 

23 


n 7 
u. / 


ft A77 


27,480 


8.95 


124 


GPM1 


231.4 


169.4 


0.902 


47,666 


8.98 


CA 

54 


A ATI 

AA1Z 


117 
11./ 


ft n 


ft 33.ft 


27,480 


8.95 


125 


GPM1 


7.5 


169.4 


0.902 


48,364 


5.25 


73 


WTM1 


7i4 C 

743 


1 3 A 
13.4 


n iac 
UO05 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0381 


48,530 


6.20 


61 


MET17 


3H.1 


7D n 
27.0 


n C7ii 
U-5 /0 


27,874 


4.46 


78 


YST1 


13.6 


52.8 


0.805 


48,904 


5.18 


cn 

69 


t ven 


1 JC 7 

lo.2 


3 7 
D.I 


ft Aft3 
U.403 


28,595 


4.51 


41 


PUP2 


4.4 


0.7 


0.147 


48,987 


a on 
4.90 


1 CI 

153 


CT 1D/1C 

oUr45 


70 A. 
2y.0 


110 

1 J. 7 


ft 7,77 
U.j/ / 


29,156 


6.59 


114 


YMR226C 


14,5 


2.2 


0.283 


Afi TT7 

49,727 


C A1 

5 .47 


7ft 

70 


DT>/"\7 

FRU2 


17 A 
13.0 


C 7 
5.Z 


ft 707 

u.zy / 


29,244 


8.40 


120 


DPMI 


5.0 


11.2 


0362 


At\ rn i 

49,912 


9.27 


c*> 
o2 


TEF2 


CCQ C 


7ft7 fl 


ft 037 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


50,444 


C iC7 

5.07 


35 


vt\d 1 orw^ 


4.o 


7 7 


ft 70ft 


30,012 


6.39 


138 


PRB1 


21.2 


13 


0.449 


50,837 


6.11 


32 


YlilJU4/U 


3.0 


1 c 

1.5 


ft Tft7 
U.50/ 


30,073 


4.63 


77 


BMH1 


14.7 


28.2 


0.454 


50,891 


A CO 

439 


1 C1 

151 


T7 TD.7 

I Uo2 


117 
11.2 


7 d 
/.4 


ft AftA 
U.*H/H 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


51,547 


c on 
o.oO 


27 


LP Dl 


1 Q O 

lo.V 


7 7 


ft "3C1 


30,435 


6.34 


89 


GPP1 


70.2 


11.2 


0.703 


52,216 


7.25 


7ft 

29 


orlM2 


ID 7 


7 A 
/.4 


0 777 


31,332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


52,859 


C CA 

534 


37 


YrK044Lr 


7rt 7 


ft 7 
0. / 


ft AA7 
\l. f ¥fc 


32,159 


5.46 


113 


1PP1 


63.1 


3.7 


0.752 


53,798 


C 1ft 

5.19 


71 


TTW7 

HXK2 


7<; c 
20.5 


7 A 
/.4 


ft 7Sft 
U./50 


32,263 


6.00 


149 


HIS1 


22.4 


4.5 


0.232 


ci oni 


c nc 
O.U5 


1 AC 

145 


UTro 


A A 
4.4 


fl 7 
w. / 


ft 127 


33,311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


54,403 


C 7ft 

5.29 


3ft 


at t\a 


7.7 7 


7 7 


ft ftftA 
U.OO** 


34,465 


5.60 


129 


ADE1 


8.7 


5.2 


A 2 AC 

0.305 


CA AM 

54,403 


C 70 

5.2y 


Aft 

40 


AJLUO 


0.0 




n AAA 


34,762 


5.32 


85 


SEC14 


10.9 


6.0 


0.373 


CA CM 

54,502 


C 7ft 

0.20 


71 
31 


AUCJ5 


ft 3 
O.J 


1 C 

l.J 


ft A17 


34,797 


5.85 


42 


URA1 


49.5 


8.9 


0.237 


54,543 


^ 7C 

7.75 


25 


BVV1 


77C 3 
225.3 


ini ft 


ft OftS 


34,799 


6.04 


90 


BELl 


103.2 


81.0 


0.875 


CA CA-y 

54,543 


7 7C 

7.75 


7K 
20 


DW1 
r Tlvl 


7Q S 


mi ft 


ft Qft< 


35,556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


55,221 


c cc 
O.OO 


1 AC 

146 


VET A71U/ 


1 ft 3 
10.3 


3 ft 


ft 7AA 
U.Z44 


35,619 


8.41 


59 


TDH1 


69.8 


32.T 


0.940 


55,295 


435 


1 1A 

134 


rUll 


CC 7 

00.2 


14 1 
14.1 


ft CftO 

uooy 


35,650 


5.49 


68 


CAR1 


5.2 


3.0 


0.339 


55364 


5.98 


7il 

24 


OLK1 


77 A 

22.0 


ft ft 
O.U 


ft 737 
U.Z5/ 


35,712 


6.72 


117 


TDH2 


49.6 


473.0 s 


0.982 


55,481 


7.97 


1 1 o 

Ho 


Alrl 


71 ft 
Z1.0 


7 7 


ft ftT.7 


35,712 


6.72 


154 


TDH2 


863.5 


473.0 s 


0.982 


55,886 


6.47 


28 


CYi4 


77 7 

22.2 




ft AAA 
U.444 


3C 71 7 


ft 77 
O./Z 


1 

ijj 




70 4 

/7.*T 


A73 ff 


0 082 


56 167 


5.83 


33 


AR08 


143 


3.0 


0324 


36,272 


4.85 


128 


APA1 


8.7 


0.7 


0.425 


56^167 


5^83 


34 


AR08 


9.1 


3.0 


0324 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


56,584 


636 


20 


CYB2 


18.9 


NA C 


0.259 


36,358 


5.05 


76 


YJR105W 


27.5 


17.1 


0.522 


57,366 


5.53 


60 


FRS2 


23 


0.7 


0.451 


36,596 


6.37 


79 


ADH2 


58.9 


260.CT 


0.711 


57,383 


5.98 


144 


ZWF1 


5.6 


0.7 


0.215 


36,714 


6.30 


102 


ADH1 


746.1 


260.0 


0.913 


57,464 


5.49 


36 


THR4 


21.4 


3.7 


0.508 


36,714 


6.30 


103 


ADH1 


17.6 


260.0 


0.913 


57,512 


5.50 


7 


SRV2 


63 


NA C 


0.260 


36,714 


6.30 


104 


ADH1 


61.4 


260.0 


0.913 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


36,714 


6.30 


105 


ADH1 


52.7 


260.0 


0.913 


58,573 


6.47 


17 


ACH1 


4.4 


13 


0327 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


58^73 


6.47 


18 


ACH1 


5.4 


1.5 


0327 


37,796 


7.36 


57 


IDH2 


29.4 


6.7 


0330 


61353 


5.87 


21 


PDC1 


6.5 


200.7 


0.962 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


61,353 


5.87 


22 


PDC1 


303.2 


200.7 


0.962 


38,700 


7.83 


55 


BAT1 


30.9 


11.2 


0.469 


61,353 


5.87 


23 


PDC1 


16.3 


200.7 


0.962 


38,702 


6.24 


46 


QCR2 


NA d 


2.2 


0.326 


61,649 


534 


38 


CCT8 


2.2 


1.5 


0.271 



Continued 



Continued on following page 



1724 GYGI ET AL. 



Mou Cell. Biol. 



TABLE \— Continued 



Mol wt 


Pi 


Spot no. 


i ivj gene 


Protein 


mRNA 


Codon 


name" 


(10 3 copies/ 
cell) 


abundance 
(copies/cell) 


bias 


61,902 


6.21 
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0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 
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6.11 
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94 
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4.8 


5.2 
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533 e 


4 


MCM3 
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0.240 



• YPD gene names arc available from the YPD website (39). 

* NA, calculation could not be performed or was not available. 
9 mRNA data inconclusive or NA. 

- No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

' Measured molecular weight or pi did not match theoretical molecular weight 
or pi. 



Protein quantitation. ( 35 S]mcthioninc-labclcd gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-ml microcentrifuge tubes, and scintillation cocktail (100 pi) was added. The 
samples were vortcxed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidcne difluoridc membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmolof protcin/mcthioninc was found. This number was used to quantiiatc 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast metabolicalry labeled for 1, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation or codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-cnd rule (33). When the N-tcrminal processing was not known 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidasc (31). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the eluting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 u.g), 
metabolically labeled with [^methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2/TDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupolc selected a single mass-to-chargc ratio (mfz) of 687.2 
(A) or S92.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupolc scanned the mass range from 50 to 1.400 mlz. The computer program Scquest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226Q. Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels* The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

, DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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FIG. 4. Current proteomc analysis technology utilizing 2DE without prcen- 
richmcnt samples mainly highly expressed and long- lived proteins. Genes encod- 
ing highly expressed proteins generally have targe codon bias values. (A) Distri- 
bution of the yeast genome (more than 6.000 genes) based on codon bias. The 
interval with the largest frequency of genes is 0.0 to 0.1, with more than 2,500 
genes. (B) Distribution of the genes from identified proteins in this study based 
on codon bias. No genes with codon bias values less than 0.1 were detected in this 
study. (C) Distribution of identified proteins in this study based on predicted 
half-life (estimated by N-cnd rule). 



mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a 2D gel chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure. It contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0 .356. 



staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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FIG. 6. Effect of highly abundant proteins on Pearson product moment correlation coefficient for mRNA and protein abundance in ycasl. The set of 106 genes was 
ranked according to protein abundance, and the correlation value was calculated by including the 40 lowest-abundance genes and then progressively including the 
remaining 66 genes in order of abundance. The correlation value climbs as the final 11 highly abundant proteins arc included. 



than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [ 35 S]methionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine amino peptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometry techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 |xg) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. The data represent the same 106 genes as in Fig. 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4 A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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iow-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies* If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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CHICKEN INTERLEUKIN-15 AND USES 
THEREOF 

This is a divisional of U.S. patent application Sen No. 
09/368,613 filed Aug. 4, 1999, which issued as U.S. Pat. No. 5 
6,287,554 and is a divisional of U.S. patent application Ser. 
No. 08/729,004, filed on Oct. 10, 1996 and now issued as 
U.S. Pat. No. 6,190,901. U.S. patent application Ser. No. 
08/729,004 claims priority to U.S. provisional patent appli- 
cation Serial No. 60/005,682 filed on Oct. 17, 1995. Each of 10 
these prior applications is hereby incorporated herein by 
reference, in its entirety. 

FIELD OF INVENTION 

The present invention pertains to isolated genes encoding 
avian interleukin-15 and to purified interleukin-15 polypep- 
tides. 

BACKGROUND OF THE INVENTION 

20 

Most chickens produced in developed countries for con- 
sumption and egg-laying (at least 10 billion per year) are 
vaccinated to protect them against Marek's disease. All of 
the egg-laying chickens and breeder stocks are also vacci- 
nated with Newcastle Disease Virus, Infectious Bursal Dis- 25 
ease Virus, Infectious Bronchitis Virus, Fowlpox Virus and 
Coccidial vaccines. For optimal protection, Marek's vacci- 
nation is performed either at or before hatching. One 
obstacle to the development of efficacious pre-hatching and 
at-hatching vaccination regimens is that the embryonic and 30 
newly hatched avian immune system is not fully developed 
and cannot mount as effective an immune response to the 
immunogen as at 2-3 weeks after hatching. Thus, there is a 
need in the art for agents and compositions that enhance the 
effectiveness of pre- and post -hatching avian vaccines. 35 

Interleukin-2 and interleukin-15 are related cytokines that 
stimulate the activity and proliferation of T cells in mam- 
mals. Though IL-2 and IL-15 both interact with the p and y 
chains of the IL-2 receptor, and may share some elements of 
tertiary structure, the two polypeptides are not homologous 40 
and represent distinct gene products. 

The genes encoding IL-15 from several different mam- 
malian species share a high degree of homology. For 
example, human and simian' IL-15 share 97% amino acid 
homology. By contrast, chicken IL-15, which is the subject 45 
of the present invention, shares only 25% amino acid 
identity with mammalian IL-15. Another distinguishing 
characteristic of chicken IL-15 is that it (and not the mam- 
malian forms) is produced by mitogen-activated spleen 
cells. Accordingly, the discovery of chicken IL-15 and the 50 
finding that it possesses T cell-stimulatory activity provide 
a novel reagent for vaccine augmentation in avian species. 
Without wishing to be bound by theory, the bioactivity of 
mammalian IL-15 in stimulating skeletal muscle develop- 
ment suggests that avian IL-15s are also useful in stimulat- 55 
ing growth in avian species. 

SUMMARY OF THE INVENTION 

The present invention provides isolated and purified DNA go 
encoding avian interleukin-15 (IL-15), as well as cloning 
and expression vectors comprising IL-15 DNA and cells 
transformed with IL-15-encoding vectors. Avian species 
from which IL-15 may be derived include without limitation 
chicken, turkey, duck, goose, quail and pheasant. 6 5 

The invention also provides isolated and purified avian 
IL-15 polypeptide, the native secreted or mature form of 



,522 B2 

2 

which has a molecular mass of about 14 kDa, an isoelectric 
point of about 6.57, a net charge of -2, and a hydrophilicity 
index of 0.278, and which has the ability to stimulate 
mitogen-activated avian T cells and to promote the growth 
of other cell types. IL-15 according to the present invention 
may be obtained from native or recombinant sources. 

Also encompassed by the invention are sequence- 
conservative and function-conservative variants of avian 
IL-15 DNA and IL-15 polypeptides, including, for example, 
a bioactive IL-15 sequence or sub-fragment that is fused 
in-frame to a purification sequence. 

In another aspect, the invention provides a method for 
enhancing an immune response in fowl to an immunogen, 
which is achieved by administering the immunogen before, 
after, or substantially simultaneously with avian IL-15 in an 
amount effective to enhance the immune response. 

In yet another aspect, the invention provides a vaccine for 
inducing an immune response in fowl to an immunogen, 
comprising the immunogen and an effective amount of avian 
interleukin-15 for immune response enhancement. The 
immunogen may be derived, for example, from avian patho- 
gens such as Marek's Disease Virus, Newcastle Disease 
Virus, Infectious Bursal Disease Virus, Infectious Bronchitis 
Virus, and the like. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is an illustration of an 845 nt sequence including 
747 nt of cDNA sequence encoding chicken interleukin-15 
(IL-15) SEQ ID NO:l. 

FIG. 2 is an illustration of a 143-amino acid sequence 
corresponding to the chicken interleukin-15 precursor 
polypeptide (SEQ ID NO:2). 

DETAILED DESCRIPTION OF THE 
INVENTION 

All patent applications, patents, and literature references 
cited in this specification are hereby incorporated by refer- 
ence in their entirety. In case of conflict, the present 
description, including definitions, will control. 

The present invention encompasses interleukin-15 (IL- 
15) from avian species. The invention provides isolated and 
purified nucleic acids encoding avian IL-15, as well as IL-15 
polypeptides purified from either native or recombinant 
sources. Avian IL-15 produced according to the present 
invention may be used in commercial fowl cultivation to 
promote growth and to enhance the efficacy of avian vac- 
cines. 

Nucleic Acids, Vectors, Transform ants 

The sequence of the cDNA encoding chicken IL-15 is 
shown in FIG. 1 (SEQ ID NO:l), and the predicted amino 
acid sequence of chicken IL-15 is shown in FIG. 2 (SEQ ID 
NO:2). The designation of this avian polypeptide as IL-15 is 
based on partial amino acid sequence homology to mam- 
malian IL-15 and the ability of the polypeptide to stimulate 
mitogen-activated T cells (see below). Furthermore, without 
wishing to be bound by theory, it is predicted that avian 
IL-15 polypeptides also exhibit one or more of the following 
bioactivities: activation of NK (natural killer) cells, stimu- 
lation of B-Cell maturation, proliferation of mast cells, and 
interaction with the beta and gamma subunits of the IL-2 
receptor. 

Because of the degeneracy of the genetic code (i.e., 
multiple codons encode certain amino acids), DNA 
sequences other than that shown in FIG. 1 can also encode 
the chicken IL-15 amino acid sequences shown in FIG. 2. 
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Such other DNAs include those containing "sequence- response elements, signal sequences, polyadenylation 

conservative" variations in which a change in one or more sequences, introns, 5 r - and 3'- noncoding regions, and the 

nucleotides in a given codon results in no alteration in the like. Transcriptional regulatory elements that may be oper- 

amino acid encoded at that position. Furthermore, a given a bly linked to avian IL-15 polypeptide DNA sequences) 

amino acid residue in a polypeptide can often be changed 5 include without limitation those that have the ability to 

without altering the overall conformation and function of the direct the expression of genes derived from prokaryotic 

native polypeptide. Such "function-conservative" variants ^ eu karyotic cells, viruses of prokaryotic cells, viruses of 

include, but are not limited Jo, replacement of an amino acid eukaryotic ^ and any combination thereof. Other useful 

with one having similar physico-chemical properties, such heterologous sequences are known to those skiUed in the art. 

as, tor example, acidic, basic, hydrophobic, and the like < rt ^ ,..„-, . « . 

(e.g., replacement of lysine with arginine, aspartate with 10 u nt °^ lei t ° acids of th u e P res f n J invent " )n caD be mod ^ ed 

glutamate, or glycine with alanine). In addition, amino acid by methods known to those skdkd in the art to alter their 

sequences may be added or deleted without destroying the stablht * solubility, binding affinity, and specificity. For 

bioactivity of the molecule. For example, additional amino exam ? le ' ih * can be selectively methylated. The 

acid sequences may be added at either amino- or carboxy- „ nuc \™ sequences of the present invention may also be 

terminal ends to serve as purification tags, (i.e., to allow 15 modl ? ed a labeI ? a P able 1 °* Priding a detectable 

one-step purification of the protein, after which they may be sl S Da !' either du ~ e ctl y or ^directly Exemplary labek include 

chemically or enzymatically removed). Alternatively, the radioisotopes, AWscent molecules, biotm, and the like, 

additional sequences may confer an additional cell-surface ^ present invention also provides vectors that include 

binding site or otherwise alter the target cell specificity of _ n nucleic acids encoding the avian IL-15 polypeptide(s). Such 

IL_i5. vectors include, for example, plasmid vectors for expression 

The chicken IL-15 cDNAs within the scope of the present in a variet y of eukaryotic and prokaryotic hosts. Preferably, 

invention are those of FIG. 1, sequence-conservative variant al f° include a P rom oter operably linked to the avian 

DNAs, DNA sequences encoding function-conservative T L ^ P ol yP e Pfide encoding portion. The encoded avian 

variant polypeptides, and combinations thereof. The inven- IL " 15 Polypeptides) may be expressed by usmg any suitable 

tion encompasses fragments of avian interleukin-15 that ^ ectors ^ host t ^ U j\ as explained herem or otherwise 

exhibit a useful degree of bioactivity, either alone or in known to those skmcd m the art ' 

combination with other sequences or components. As Vectors will often include one or more replication systems 

explained below, it is weU within the ordinary skill in the art for cloning or expression, one or more markers for selection 

to predictively manipulate the sequence of IL-15 and estab- 30 m the host such as, for example, antibiotic resistance, and 

lish whether a given avian IL-15 variant possesses an one or more expression cassettes. The inserted coding 

appropriate stability and bioactivity for a given application. sequences may be synthesized, isolated from natural 

This can be achieved by expressing and purifying the variant sources, prepared as hybrids, or the like. Ligation of the 

IL-15 polypeptide in a recombinant system and assaying its coding sequences to the transcriptional regulatory sequences 

T-cell stimulatory activity and/or growth-promoting activity 35 ma y be achieved by methods known to those skilled in the 

in cell culture and in animals, followed by testing in the art - Suitable host cells may be transformed/transfected/ 

application. infected by any suitable method including electroporation, 

The present invention also encompasses IL-15 DNAs CaQ 2 - or liposome-mediated DNA uptake, fungal infection, 

(and polypeptides) derived from other avian species, includ- microinjection, microprojectile, or the like, 

ing without limitation ducks, turkeys, pheasants, quail and 40 Suitable vectors for use in practicing the present invention 

geese. Avian IL-15 homologues of the chicken sequence include without limitation YEp352, pcDNAI (In Vitrogen, 

shown in FIG. 1 are easily identified by screening cDNA or San Diego, Calif.), pRc/CMV (InVitrogen), and pSFVl 

genomic libraries to identify clones that hybridize to probes (GIBCO/BRL, Gaithersburg, Md.). One preferred vector for 

comprising all or part of the sequence of FIG. 1. use in the invention is pSF VI. Suitable host cells include £. 

Alternatively, expression libraries may be screened using 45 Co//, yeast, COS cells, PCI 2 cells, CI 10 cells, GII4C1 cells, 

antibodies that recognize chicken IL-15. Without wishing to BHK-21 cells, and amphibian melanophore cells. BHK-21 

be bound by theory, it is anticipated that IL-15 genes from cells are a preferred host cell line for use in practicing the 

other avian species will share at least about 70% homology present invention. 

with the chicken IL-15 gene. Also within the scope of the Nucleic acids encoding avian IL-15 polypeptide(s) may 

invention are DNAs that encode chicken homologues of 50 also be introduced into cells by recombination events. For 

IL-15, defined as DNA encoding polypeptides that share at example, such a sequence can be microinjected into a cell, 

least about 25% amino acid identity with chicken IL-15. effecting homologous recombination at the site of an endog- 

Generally, nucleic acid manipulations according to the enous gene encoding the polypeptide, an analog or pseudo- 
present invention use methods that are well known in the art, gene thereof, or a sequence with substantial identity to an 
such as those as disclosed in, for example, Molecular 55 avian IL-15 polypeptide-encoding gene. Other 
Cloning, A Laboratory Manual (2nd Ed., Sambrook, Fritsch recombination -based methods such as non-homologous 
and Maniatis, Cold Spring Harbor), or Current Protocols in recombinations, and deletion of endogenous gene by 
Molecular Biology (Eds. Aufubel, Brent, Kingston, More, homologous recombination, especially in pluripotent cells, 
Feidman, Smith and Stuhl, Greene Publ. Assoc., Wiley- may also be used. 
Interscience, NY, N.Y., 1992). 6 o IL-15 Polypeptides 

The present invention encompasses cDN A and RN A The chicken IL-15 gene (the cDNA of which is shown in 

sequences and sense and antisense sequences. The invention FIG. 1) encodes a polypeptide of 143 amino acids (FIG. 2). 

also encompasses genomic avian IL-15 polypeptide DNA Without wishing to be bound by theory, by comparison with 

sequences and flanking sequences, including, but not limited simian IL-15, and by use of an accepted procedure to predict 

to, regulatory sequences. Nucleic acid sequences encoding 65 signal peptidase cleavage sites (Von Heijne, Nuc Acids Res., 

avian IL-15 polypeptide(s) may also be associated with 14:4683, 1986), it is predicted that an aminoterminal leader 

heterologous sequences, including promoters, enhancers, sequence of about 22 amino acids (secretion signal peptide) 



US 6,737,522 B2 

5 6 

is cleaved from the primary translation product to produce Anti-avian IL-15 antibodies may be used to identify and 

mature IL-15. The predicted mature sequence of 121 amino quantify avian IL-15, using immunoassays such as ELISA, 

acids is further characterized by a predicted molecular RIA, and the like. Anti-avian IL-15 antibodies may also be 

weight of 13,971 daltons; an isoelectric point of 6.57; four use d to immunodeplete extracts of avian IL-15. In addition, 

cysteine residues (at amino acids numbers 63, 70, 116, and 5 these antibodies can be used to identify, isolate, and purify 

119 in the precursor IL-15 shown in FIG. 2) that correspond av j a n IL-15s from different sources, and to perform subcel- 

to four cysteines conserved among human, mouse, and i u i ar an( j histochemical localization studies, 

monkey IL-15 and that are believed to participate in Applications 

intramolecular disulfide bonding; and one consensus site for Avian IL-15 produced according to the present invention 

N-linked glycosylation (at asparagine 110 of the sequence 10 can be used beneficially in homologous or heterologous 

shown in FIG. 2) which corresponds to a similar site in species, for example, to stimulate activated T-cells 

human IL-15. (Grabstein et al., Science, 264:965, 1994) and B-cells 

Purification of IL-15 from natural or recombinant sources (Armitage et al., J. Immunol., 154:483, 1995) and/or to 

may be achieved by methods well-known in the art, includ- promote the growth of non-immune cells, such as, for 

ing without limitation ion-exchange chromatography, 15 example, muscle cells (Quinn et al., Endocrinol. 136:3669, 

reverse-phase chromatography on C4 columns, gel filtration, 1995). 

isoelectric focusing, affinity chromatography, iramunoaffin- Vaccines 

ity chromatography, and the like. In a preferred embodiment, The present invention encompasses methods and compo- 

large quantities of bioactive IL-15 may be obtained by sitions for enhancing the efficacy of an immune response in 

constructing a recombinant DNA sequence comprising the 20 avian species. In this embodiment, avian IL-15 is used in 

coding region for IL-15 fused in frame to a sequence conjunction with an immunogen for which it is desired to 

encoding 6 C-terminal histidine residues in the pSFVl elicit an immune response. For example, in avian vaccines, 

replicon (GIBCO/BRL). mRNA encoded by this plasmid is such as those against Marek's disease, Newcastle Disease 

synthesized using techniques well-known to those skilled in Virus, and other pathogens such as Infectious Bursal Disease 

the art and introduced into BHK-21 cells by electroporation. 25 Virus and Infectious Bronchitis Virus, it is desirable to 

The cells synthesize and secrete mature glycosylated IL-15 include avian IL-15 in the vaccine to enhance the magnitude 

polypeptides containing 6 C-terminal histidines. The modi- and quality of the immune response. For this purpose, IL-15 

fied IL-15 polypeptides are easily purified from the cell purified from native or recombinant sources as described 

supernatant by affinity chromatography using a histidine- above is included in the vaccine formulation at a concen- 

binding resin (His-bind, Novagen, Madison, Wis.). 30 tration ranging from about 0.01 /*g to about 1.0 fig per 

Avian IL-15 polypeptides isolated from any source can be vaccine per chicken, 

modified by methods known in the art. For example, avian IL-15 may be administered in conjunction with a live (i.e., 

IL-15 may be phosphorylated or dephosphorylated, glyco- replicating) vaccine or a non-replicating vaccine. Non- 

sylated or deglycosylated, and the like. Especially useful are limiting examples of replicating vaccines are those compris- 

modifications that alter avian IL-15 solubility, stability, and 35 ing native or recombinant viruses or bacteria, such as 

binding specificity and affinity. modified turkey herpesvirus or modified fowlpox virus. 

Anti-IL-15 Antibodies Non-limiting examples of non-replicating vaccines are those 

The present invention encompasses antibodies that are comprising killed or inactivated viruses or other 

specific for avian IL-15 polypeptides identified as described microorganisms, or crude or purified antigens derived from 

above. The antibodies may be polyclonal or monoclonal, 40 native, recombinant, or synthetic sources, such as, for 

and may discriminate avian IL-15s from different species, example, coccidial vaccines. Commercial sources for avian 

identify functional domains, and the like. Such antibodies vaccines include without limitation: Rhone Merieux 

are conveniently made using the methods and compositions Laboratoire-IFFA (Lyon, France); Intervet International BV 

disclosed in Harlow and Lane, Antibodies, A Laboratory (Boxmeer, The Netherlands); Mallinckrodt Veterinary; 

Manual, Cold Spring Harbor Laboratory, 1988, other refer- 45 Solvay Animal Health (Mendota Heights, Minn.); Hoechst- 

ences cited herein, as well as immunological and hybridoma Roussel (Knoxville, Tenn.); and Nippon Zeon Co., Ltd. 

technologies known to those in the art. Where natural or (Kawasaki-Kiu, Japan). 

synthetic avian IL-15 -derived peptides are used to induce an In one embodiment, the gene encoding IL-15 is incorpo- 

avian IL-15-specific immune response, the peptides may be rated into a recombinant virus, which is then formulated into 

conveniently coupled to a suitable carrier such as KLH and 50 a live vaccine. The IL-15 gene is incorporated into the virus 

administered in a suitable adjuvant such as Freund's. so that its expression is controlled by an appropriate pro- 

Preferably, selected peptides are coupled to a lysine core moter. Administration of the vaccine results in the expres- 

carrier substantially according to the methods of Tarn (1988) sion of bioactive IL-15 in close temporal and spatial prox- 

Proc. Natl. Acad. Sci. USA, 85:5409-5413. The resulting imity to the desired immune response, thus enhancing the 

antibodies may be modified to a monovalent form e.g. Fab, 55 vaccine's efficacy. 

FAB\ or FV. Anti-idiotypic antibodies, especially internal IL-15 may be administered to birds as part of a vaccine 

imaging anti-idiotypic antibodies, may also be prepared formulation either before or after hatching, preferably before 

using known methods. hatching, using methods known in the art such as those 

In one embodiment, purified avian IL-15 is used to described in U.S. Pat. Nos. 5,034,513 and 5,028,421. 

immunize mice, after which their spleens are removed, and 60 Growth Promotion 

splenocytes used to form cell hybrids with myeloma cells to The present invention provides methods and composi- 
obtain clones of antibody-secreting cells according to tech- tions for enhancing the growth of avian species for medical 
niques that are standard in the art. The resulting monoclonal and/or commercial purposes. In this embodiment, IL-15 is 
antibodies secreted by such cells are screened using in vitro administered to birds using any appropriate mode of admin- 
assays for the following activities: binding to avian IL-15, 65 istration. For growth promotion, IL-15 is administered in 
inhibiting the receptor-binding activity of IL-15, and inhib- amounts ranging from about 0.25 /<g/kg/day to about 25 
iting the T-cell stimulatory activity of IL-15. /ig/kg/day. It will be understood that the required amount of 
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IL-15 can be determined by routine experimentation well- Model 373A sequencing system. The flanking T7 and SP6 

known in the art, such as by establishing a matrix of dosages primers contained in the pcDNAl vector were used to prime 

and frequencies and comparing a group of experimental the sequencing reaction. Two of the clones, B2.16.2 and 

units or subjects to each point in the matrix. M2.12.1, were identical and coded for the cDNA sequence 

According to the present invention, native or recombinant 5 shown in FIG. 1. Clone F19.84 was similar to those two 

avian IL-15 may be formulated with a physiologically clones, but was missing the 20 nt at its 5' end (i.e., starting 

acceptable carrier, such as, for example, phosphate buffered at the first ATG of the coding region) and contained a poly 

saline or deionized water. The formulation may also contain T tail of at least 100 nt at its 3* end. 
excipients, including lubricant(s), plasticizer(s), colorant(s), The entire 747 nt sequence (FIG. 1, SEQ ID NO:l) was 

absorption enhancers), bactericide(s), and the like that are 10 analyzed using a BLAST search (which accesses all of the 

well-known in the art. The IL-15 polypeptide of the inven- ma j or international nucleotide data banks). No significant 

tion may be administered by any effective means, including homology was detected with any other known sequence. The 

without limitation intravenous, subcutaneous, sequence was also analyzed using the Mac Vector software 

intramuscular, transmucosal, topical, or oral routes. For program (MacVector 4.0; International Biotechnologies, 

subcutaneous administration, for example, the dosage form 15 \ nc ^ N ew Haven, Conn.) on a Mac Ilci computer. This 

may consist of IL-15 in sterile physiological saline. For oral analysis revealed an open reading frame flanked at its 5' end 

administration, IL-15, with or without excipients, may be by a Kozak consensus sequence for translation initiation, 

micro- or macro-encapsulated in, e.g., liposomes and micro- The predicted amino acid sequence of this open reading 

spheres. Dermal patches (or other slow-release dosage frame & sno wn in FIG. 2 (Seq ID NO:2). This amino acid 

forms) may also be used. 20 sequence was analyzed using a BLASTP search (which 

The following examples are intended to further illustrate accesses all of the major international protein data banks) 

the invention without limiting its scope thereof. revealing significant homology with monkey and human 

precursor IL-15. 
The predicted amino acid sequence of chicken IL-15 

Cloning of the Chicken IL-15 Gene 2$ consists of a 143 amino acid Peptide having a predicted 

molecular weight of 16,305 and an isoelectric point of 6.37. 

To clone chicken IL-15, a chicken spleen cell cDNA Based on the hydrophobicity of its amino terminal end and 

library derived from spleen cells that had been activated by comparison with known signal peptide cleavage sites 

with concanavalin A was utilized (Kaplan, J. Immunol ( von Heijne, Nucleic Acids Res. 14:4683, 1986) it is pre- 

151:628, 1993). 5000 colonies were grown overnight at 35° 30 dieted that cleavage between glycine-22 and alanine-23 

C. on LB agar plates containing 30/ig/ml ampicillin and 10 results in the removal of an aminoterminal leader sequence 

/ig/ml tetracycline. 15-20 colonies were pooled and trans- of about 22 amino acids (secretion signal peptide) from the 

ferred to 10 ml Terrific Broth (containing the same primary product to produce mature IL-15. 
antibiotics) and grown overnight. Plasmid DNA from each The predicted mature IL-15 sequence of 121 amino acids 

pool was then isolated by published procedures (Maniatis, has a predicted molecular weight of 13,971, an isoelectric 

Section 1.28), treated with RNAase (10 //g/ml), and stored po i nt 0 f 6.57, and a possible N-linked glycosylation site (at 

in TE buffer. asparagine 110 of FIG. 2). Comparisons between the pre- 

The plasmid DNAs were transfected into COS-7(ATCC) dieted amino acid sequences of IL-15 from monkey, human, 

cells using Lipofectamine (GIBCO/BRL, Gaithersburg, 4Q mouse and chicken and analysis of the tertiary structure of 

Md.). 1 fig of each plasmid pool was mixed with 3 fi\ monkey IL-15 (Grabstein, Science, 264:965, 1994) suggest 

Lipofectamine in 100 /d Opti-MEM medium (GIBCO/ that four cysteines in chicken IL-15 (positions 63, 70, 116 

BRL), incubated for 30 min, and then placed on COS-7 cells and 119 of precursor IL-15, FIG. 2) are conserved and form 

that had been grown to 80-90% confluence in 12-well plates intrachain disulfide bonds, 
and rinsed in serum-free medium. The cells and DNA were 

incubated for 5 hrs at 37° C. with Dulbecco's MEM in the EXAMPLE 2 

absence of serum and antibiotics, and then supplemented 

with the same medium containing 10% fetal calf serum and Bioactivity Assay for Chicken IL-15 

incubated overnight at 37° C. The next day, the medium was Bioactivity assays for IL-15 are performed as follows: 

replaced with Dulbecco's MEM containing 10% fetal calf 5Q Concanavalin A (ConA)-activated splenic T cells are pre- 

serum, penicillin, and streptomycin. After an additional 24 pare a by incubating chicken spleen cells (10 7 cells/ml) with 

hrs of incubation, the medium was collected and stored at Con A ( 10 ^ m{ ) (Si gma Chemical Co., St. Louis, Mo.) in 

~ 20 ° C - RPMI 1640 medium (Sigma) containing 2 mg/ml BSA, 

The cell supernatants were tested for IL-15 activity as antibiotics and glutamine at 40° C. for 24 hrs. The medium 

described in Example 2 below. Five pools with the highest 55 is then replaced with Iscoves' medium (Sigma) containing 

stimulation indices (1.6 to 2.1) exhibited levels of activity 2% normal chicken serum (Sigma) and 0.05M alpha-methyl 

that were greater than 2 standard deviations from the mean pyrannoside (Sigma) for an additional 2-4 days, diluting the 

of the remaining 278 pools. Three of the five pools remained cells in additional medium as needed. Blast cells are purified 

positive in a second screen, and were subdivided into pools from this mixture by gently layering them on a Histopaque 

of 6. Plasmid DNA extracted from the secondary pools was 60 density gradient (Sigma) and centrifuging them according to 

used to transfect COS-7 cells and the supernatants were the manufacturer's instructions. The cells are then washed 

tested for IL-2-like activity. As described below in Example three times and finally resuspended in assay medium 

2, three positive pools were identified and subdivided to (Iscoves' containing 2% normal chicken serum (Sigma)), 
yield individual clones; from each pool at least one positive For the assay, 2xl0 4 blast cells are placed in roundbottom 

clone was isolated. 65 96 we n p i a tes in assay medium containing IL-15 (such as, 

The complete cDNA inserts of all three positive clones e.g., dilutions of supernatant from transfected COS-7 cells) 

were sequenced using the automated Applied Biosystems or appropriate controls. After overnight incubation at 40° C, 
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the cells are pulsed for 6 hrs with 3 H-thymidine (0.5 /iCi) 
(New England Nuclear, Boston, Mass.)+fluorodeoxyuridine 
(1(T 6 M) (Sigma). The cells are then harvested on glass fiber 
filters (Whatman, Clifton, NJ.), and the radioactivity is 
measured in a liquid scintillation counter. IL-15 is expressed 
as a stimulation index, which is the radioactivity in experi- 
mental samples — the radioactivity in controls (non- 
transfected COS-7 supernatants). A typical result is shown in 
Table 1. 



10 



mg total protein, with up to 5% comprising a recombinantly 
expressed and secreted protein. This corresponds to approxi- 
mately 1 .25 mg of cIL-15. 

EXAMPLE 4 

Use of Avian IL-15 in Vaccines 

The following experiments are performed to evaluate the 
immune-enhancing activity of chicken IL-15 in chicken 
vaccines. 



TABLE 1 



SOURCE 

OF PLASMID DNA Stimulation indices 



DNA Designation 1/10 dU* 1/10 dil b 1/33 dU b 1/100 dU b 



PRIMARY 


A19 


1.6 


1.9 


1.3 


1.2 


POOLS 


B2 


Zl 


4.2 


2.3 


1.7 




E7 


1.8 


1.7 


1.5 


0.9 




F19 


1.8 


3.5 


2.0 


1.2 




M2 


1.7 


3.2 


1.9 


1.3 




Ave. of 278 * SD 


1.1 * 0.1 










Ave. of 3 Neg. pools 




1.4 


1.3 


1.1 


SECONDARY 


A19.7 




0.7 


1.9 




POOLS 


B2.16 




6.0 


3.5 






F19.8 




9.8 


3.4 






M2.12 




3.2 


2.2 




INDIVIDUAL 


B2.16.2 




6.6 


3.3 


2.7 


CLONES 


F19.8.4 




7.5 


4.0 


3.0 




M2.12.1 




7.2 


3.9 


3.6 



"First screening at 1/10 dil. 

b A repeat transfection using 5 positive and 3 negative primary pools 



EXAMPLE 3 
Expression and Purification of IL-15 

35 

To obtain high-level expression of chicken IL-15 in 
mammalian cells, the pSFVl eukaryotic expression vector 
(which includes the Semliki Forest Virus replicon) is used 
(GIBCO/BRL, Gaithersburg, Md.). Use of this vector allows 
for signal peptide cleavage, glycosylation, and secretion of ^ 
mature active protein. In one embodiment, the recombinant 
vector encodes an additional six histidine residues at the 
carboxy terminus of the native IL-15 sequence, allowing the 
efficient single step purification of the secreted protein on a 
nickel column (Novagen, Madison, Wis.). 45 

Primers were constructed that include 5' and 3' sequences 
flanking the coding region of IL-15 cDNA. The 3' primer 
also includes nucleotides coding for 6 histidines. These 
primers were used in polymerase chain reaction (PCR), 
using as a template the entire IL-2 cDNA contained within 50 
the pcDNAl plasmid. The resulting amplified cDNA, 
including the histidine-coding sequences, was ligated into 
the pSFVl plasmid (GIBCO/BRL). The plasmid was 
obtained by transforming DH5 E. coli (GIBCO/BRL) and 
selecting transformants on agar plates and broth containing 55 
ampicillin. 

This plasmid is used as a template to produce mRNA in 
vitro, using manufacturer's protocols. The mRNA is trans- 
fected into BHK-21 cells by electroporation, using 10 /*g 
RNA per 10 7 cells, after which the cells are incubated for 60 
1-3 days. The cell supernatant is harvested and passed 
through a resin matrix (His-Bind resin; Novagen, Madison, 
Wis.) using a suitable buffer system (His-bind buffer kit; 
Novagen). Up to 20 mg of tagged protein can be purified on 
a single 2.5 ml column. The IL-15 is eluted from the column 65 
with the elution buffer provided in the kit. It is estimated that 
BHK-21 cells growing in 50 ml medium synthesize about 25 



Chicken IL-15 cDNA is inserted into two viral vectors 
(derived from turkey herpesvirus and fowlpox virus, 
respectively) that are used for the expression of recombinant 
proteins in chickens (Morgan et al., Avian Diseases, 36:858, 
1992; Yanagida et al., J. Virol, 66:1402, 1992; Nazerian et 
al, J. Virol, 66:1409, 1992). These IL-15-modified live viral 
vectors are administered to newly hatched chicks simulta- 
neously with the administration of various vaccines cur- 
rently available. Six days later the chicks are challenged 
with the corresponding virulent viruses and observed for 8 
weeks for the development of disease. The incidence of 
disease in these chicks is compared with controls that do not 
receive the IL-15-modified live viral vectors. A sample 
protocol (including expected results) is shown in Table 2. 



TABLE 2 





Treatment 


Challenge 


% expected 


Group # 


on day 1 


at day 6 


with disease 


1 


none 


none 


0 


2 


none 


virulent Marek's 


>80% 


3 


HVT (not modified) 


virulent Marek's 


20% 


4 


HVT-IL-15" 


virulent Marek's 


0 to 10% 


5 


HVT (not modified) + 


virulent Marek's 


0 to 10% 




HVT-IL-15 






6 


none 


virulent NDV 


>80% 


7 


HVT-IF-15 


virulent NDV 


30% to >50% 


8 


NOV vaccine 


virulent NDV 


20% 


9 


NDV vaccine + 


virulent NDV 


0 to 10% 




HVT-IL-15 







herpesvirus of turkeys expressing IL-15 



In an alternative procedure, newly hatched chicks are 
injected intramuscularly with 100 //g of a plasmid containing 
cDNA for chicken IL-15, using the methods described in 
Ulmer, J. B. Science, 259:1745-1749, 1993. These chicks, 
and control chicks receiving a control vector lacking IL-15 
cDNA, are vaccinated on day 2 with chicken vaccines and 
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then challenged on day 7 with the corresponding virulent 
viruses. They are observed for 8 weeks for signs of disease. 
It is expected that chicks injected with the pcDNAi vector 
containing IL-15 cDNA will exhibit a reduced incidence of 
disease relative to controls. 

Finally, IL-15 protein purified by the procedure described 
in Example 3 is administered intramuscularly to chicks at 
hatching, followed by a single daily administration on each 
of the following four days. Chicks are divided into three 
groups, receiving 0.01, 0.1 or 1.0 fig per injection per day. 
A fourth group receives placebo injections. At hatching all 
chicks are vaccinated with chicken vaccines and then chal- 
lenged on day 7 with the corresponding virulent viruses. 
They are then observed for 8 weeks for signs of disease. It 
is expected that chicks injected with IL-15 will exhibit a 
reduced incidence of disease relative to controls. 
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EXAMPLE 5 



Use of Avian IL-15 in Growth Promotion 

Mammalian IL-15 stimulates muscle growth (Quinn, L. 
S., Endocrin., 136:3669, 1995) and semi-pure chicken IL-2 
stimulates chicken body weight and increases feed conver- 
sion (U.S. Pat. No. 5,028,421). To evaluate the growth- 
promoting activity of avian IL-15, the methods described in 
Example 4 above may be used to administer IL-15 cDNAin 
a viral or plasmid vectors recombinant IL15 protein. Experi- 
mental and control chicks are monitored for weight gain and 
feed conversion for a period of six weeks. It is expected that 
one or more of these protocols will enhance chicken growth 
over controls. 



SEQUENCE LISTING 

<160> NUMBER OF SEQ ID NOS : 2 

<210> SEQ ID NO 1 
<211> LENGTH: 747 
<212> TYPE: DNA 

<213> ORGANISM: Gallus domes ticus 
<400> SEQUENCE: 1 

cagataactg ggacactgcc atgatgtgca aagtactgat ctttggctgt atttcggtag 60 

coacgctaat gactacagct tatggagcat ctctatcatc agcaaaaagg aaacctcttc 120 

aaacattaat aaaggattta gaaatattgg aaaatatcaa gaacaagatt catctcgagc 180 

tctacacacc aactgagacc caggagtgca cccagcaaac tctgcagtgt tacctgggag 240 

aagtggttac tctgaagaaa gaaactgaag atgacactga aattaaagaa gaatttgtaa 300 

ctgctattca aaatatcgaa aagaacctca agagtcttac gggtctaaat cacaccggaa 360 

gtgaatgcaa gatctgtgaa gctaacaaca agaaaaaatt tcctgatttt ctccatgaac 420 

tgaccaactt tgtgagatat ctgcaaaaat aagcaactaa tcatttttat tttactgcta 480 

tgttatttat ttaattattt aattacagat aatttatata ttttatcccg tggctaacta 540 

atctgctgtc cattctggga ccactgtatg ctcttagtct gggtgatatg acgtctgttc 600 

taagatcata tttgatcctt tctgtaacct acgggctcaa aatgtacgtt ggaaaactga 660 

ttgattctca ctttgtcggt aaagtgatat gtgtttactg aaagaatttt taaaagtcac 720 

ttctagatga catttaataa atttcag 747 



<210> SEQ ID NO 2 
<211> LENGTH: 14 3 
<212> TYPE: PRT 

<213> ORGANISM: Gallus domesticus 
<400> SEQUENCE: 2 

Met Met Cys Lys Val Leu lie Phe Gly Cys He Ser Val Ala Thr Leu 
15 10 15 

Met Thr Thr Ala Tyr Gly Ala Ser Leu Ser Ser Ala Lys Arg Lys Pro 
20 25 30 

Leu Gin Thr Leu He Lys Asp Leu Glu He Leu Glu Asn He Lys Asn 
35 40 45 

Lys He His Leu Glu Leu Tyr Thr Pro Thr Glu Thr Gin Glu Cys Thr 
50 55 60 
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-continued 



Gin 
65 


Gin 


Thr 


Leu 


Gin 


Cys 
70 


Tyr 


Leu 


Gly 


Glu 


Val 

75 


Val 


Thr 


Leu 


Lys 


Lys 
80 


Glu 


Thr 


Glu 


Asp 


Asp 

85 


Thr 


Glu 


He 


Lys 


Glu 
90 


Glu 


Phe 


Val 


Thr 


Ala 

95 


He 


Gin 


Asn 


He 


Glu 
100 


Lys 


Asn 


Leu 


Lys 


Ser 
105 


Leu 


Thr 


Gly 


Leu 


Asn 
110 


His 


Thr 


Gly 


Ser 


Glu 
115 


Cys 


Lys 


He 


Cys 


Glu 
120 


Ala 


Asn 


Asn 


Lys 


Lys 
125 


Lys 


Phe 


Pro 


Asp 


Phe 
130 


Leu 


His 


Glu 


Leu 


Thr 
135 


Asn 


Phe 


Val 


Arg 


Tyr 
140 


Leu 


Gin 


Lys 





What is claimed is: 12. The complement of a nucleic acid according to claim 

1. An isolated nucleic acid which: 11. 

(a) comprises a nucleic acid sequence having at least 70% 13 ^ isolated nucleic acid according to claim 11 or 12, 
sequence homology, determined by a BLAST 20 whlch nucleic acid fc an avian nucleic acid isolated from 

algorithm, to the sequence set forth in nucleotides chicken. 

87_449 of SEO ID NOT and vector construct comprising the nucleic acid of 

' ' claim 8 or 11. 

(b) encodes a polypeptide capable of stimulating thymi- 15 AThe vector construct according to claim 14, in which 
dine incorporation in mitogen activated avian T-cells. 25 sa id nucleic acid is operatively associated with a promoter 

2. The complement of a nucleic acid according to claim 1 . element capable of expressing the nucleic acid in a host cell. 

3. An isolated nucleic acid according to claim 1 or 2, 16. The vector construct according to claim 14, in which 
which nucleic acid is an avian nucleic acid isolated from the construct is a recombinant virus. 

chicken. 17. The vector construct according to claim 16, in which 

4. A vector construct comprising the nucleic acid of claim 30 the recombinant virus is a turkey herpes virus or a fowl pox 
1. virus. 

5. The vector construct according to claim 4, in which said 18. An isolated nucleic acid having an open reading frame 
nucleic acid is operatively associated with a promoter ele- tnat encodes a polypeptide comprising the sequence of 
ment capable of expressing the nucleic acid in a host cell. amino acid residues 23-143 set forth in SEQ ID NO:2 (FIG. 

6. The vector construct according to claim 4, in which the 35 2). 

construct is a recombinant virus. 19. The complement of a nucleic acid according to claim 

7. The vector construct according to claim 6, in which the 18. 

recombinant virus is a turkey herpes virus or a fowl pox 20 An isolated nucleic acid according to claim 18 or 19, 

virus. which nucleic acid is an avian nucleic acid isolated from 

8. An isolated nucleic acid which: 40 chicken. 

(a) hybridizes to the full length of a nucleic acid having 21 /* isolated nucleic acid accoiding to claim 18, 
the complementary sequence of nucleotides 87^49 in w °erein the polypeptide comprises the amino acid sequence 
SEQ ID NO:l under conditions comprising (i) hybrid- ^ or * m SEQ ID NO:2 ( FIG " 2 >' 

ization in 6xSSC and 0.5% SDS, and (ii) washing at 22 ' ^ complement of a nucleic acid according to claim 

68° C. in O.lxSSC and 0.5% SDS; and 45 21 1 ' 

/ux , , c . t . . 23. An isolated nucleic acid according to claim 21 or 22 

(b) encodes a po ypept.de capable of stimulating thymi- which Quc]eic ^ & aQ avian nucleic * acid from 
dme incorporation m mitogen activated avian T-cells. chicken 

9. The complement of a nucleic acid according to claim 8. ~ A A ' " . - . A . , . ., c 
ia a • 1 . j 1 • • j ,. L ; 1 . o ft 24. A vector construct comprising the nucleic acid of 

10. An isolated nucleic acid according to claim 8 or 9, c faim 18 or 21 

which nucleic acid is an avian nucleic acid isolated from 50 ~ c „, ' t . . . ... 

chicken e ve construct according to claim 24, in which 

„ . , A , » j i_ » i_ said nucleic acid is operatively associated with a promoter 

11. An isolated nucleic acid which: , . , , c • . u t • j-i_.it 

element capable of expressing the nucleic acid in a host cell. 

(a) hybridizes to the full length of a nucleic acid having 26. The vector construct according to claim 24, in which 
the complementary sequence of nucleotides 87^49 in the construct is a recombinant virus. 

SEQ ID NO:l under conditions comprising (i) hybrid- 55 2 7. The vector construct according to claim 26, in which 

ization in 6xSSC and 0.5% SDS, and (ii) washing at the recombinant virus is a turkey herpes virus or a fowl pox 

room temperature in 2xSSC and 0.5% SDS; and virus. 

(b) encodes a polypeptide capable of stimulating thymi- 
dine incorporation in mitogen activated avian T-cells. ***** 
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